|
|
|
|
How to search the Database?
The database is exclusively based on peer-reviewed literature reports
and represents a repository for all published TP53 mutation data. Users
should be aware of the following limitations and possible biases that
may affect the analysis of the database:
Trends in reporting and publishing mutations
Publication of the same set of mutations in different papers by the
same authors is a serious problem that has led to duplicates entries in
the database in the past. We now perform systematic searches of the
database under the author’s name to identify earlier entries that may
correspond to the same mutations. We have also extensively reviewed the
entire dataset in order to find and eliminate these duplications.
However, despite these efforts, some duplicates may remain in the
database and their identification is an ongoing task.
Another problem arises from the fact that publications may contain data
that are either erroneous or affected by experimental biases. Our policy
is to enter all published data into the database, with the idea that the
database should represent a repository for all the published TP53
mutation data. It is the responsibility of database users to make their
own critical judgement on the validity of the data that they include in
their analyses. Indeed, it is possible that sets of data that seem
aberrant will later appear to correspond to very rare mutations
occurring only in specific population groups.
Editorial policies have a major impact on the reporting of TP53
mutations. In particular, studies that confirm earlier findings in
common cancers are often considered as repetitive and thus rejected.
Moreover, many recent studies do not provide detailed information on
each mutation detected but rather report their results in the form of
summary tables or graphs. This information can't be entered in the
database, resulting in the loss of important data. Thus, database users
should be aware that the dataset cannot be considered as an exhaustive
description of the frequency, nature and distribution of TP53 mutations
in human cancers.
Study design
Most of the data included in the database are from retrospective
studies where tumor cases are selected on the basis of relevance to
pathological questions. Less than 10% of publications are molecular
epidemiological studies with information on individual exposures. The
lack of detailed information on exposure is a major concern for the
correct interpretation of mutation patterns.
Another potential source of bias lies in the methodology used for
mutation detection. The mode of tissue fixation and conservation may
affect the sensitivity of the detection. Each of the commonly used,
pre-screening methods (SSCP, DGGE, yeast functional assay) has its own
limitations in terms of sensitivity and specificity. Many studies are
limited to exons 5-8, because initial reports have shown that mutations
tend to cluster in this region. Database users should also be aware that
some studies might be affected by important technical artifacts. Such
artifacts may have an impact on the interpretations of rare mutation
patterns.
Database annotations
Data annotation (on tumor pathology, individual exposures, clinical
parameters and cancer outcome) involves an effort of standardization.
For example, we have adopted the International
Classification of Disease for Oncology as a standard for tumor sample
description. As many studies use different classifications, we often
have to translate these classifications to ICD-O standards. The
same applies to data on individual exposure (such as for example tobacco
or alcohol consumption, viral infections, etc...), and to the description of
mutations. Mutations are sometimes poorly described or only described at the RNA or protein
level, while we annotate them at the genomic level (more in the FAQ).
In the past years, we have updated our annotation system several times in order to
improve its accuracy.
We recommend reading the two following publications before using the
database:
Petitjean A, Mathe E, Kato S, Ishioka C, Tavtigian SV, Hainaut P, Olivier M. Impact of mutant p53 functional properties on TP53 mutation patterns and tumor phenotype:
lessons from recent developments in the IARC TP53 database. Hum Mutat. 2007 Jun;28(6):622-9 PubMed
Hernandez-Boussard T, Montesano R, Hainaut P. Sources of bias in the
detection and reporting of p53 mutations in human cancer: analysis of
the IARC p53 mutation database. Genet Anal. 1999;14(5-6):229-33. PubMed
The web site has been optimized for PC computers using Internet Explorer 5 or
later versions. Searching the database with a Mac computer may not function properly (some functions work
with Mac OS X operating system and Safari browser).
The mutation pattern option produces a pie chart showing the proportion of the different types of mutations
(classified by their nature: base change, insertions, deletions....) in the selected set of tumors (calculated as follows:
number of mutations of each type divided by the total number of mutations selected).
You can get a mutation pattern pie chart from the Basic search or Advanced search options. The Basic search allow you to select
a tumor site (Topography) and/or a tumor type (Morphology) whereas with the Advanced search a precise set of data can be selected
by defining the type of tumor, the population affected and the references in which the mutations are described.
Strand-bias
This option displays a table showing the description of the mutation
event by mutation type for single base substitutions. This allows the
calculation of the strand-bias (non-equal distribution of a mutation on
each strand of DNA) for each mutation event.
The Mutation Effect option produces a pie chart showing the proportion of different types of mutations
(classified by their effect: missense, nonsense,....) in a selected set of tumors (calculated as follows: number of
mutations of each type divided by the total number of mutations selected).
You can get a Mutation effect pie chart from the Basic search or Advanced search options. The Basic search allow you to select
a tumor site (Topography) and/or a tumor type (Morphology) whereas with the Advanced search a precise set of data can be selected
by defining the type of tumor, the population affected and the references in which the mutations are described.
The codon distribution option produces a histogram showing the proportion of single base substitutions at each codon in a selected set
of tumors (calculated as follows: number of single base substitutions at each codon divided by the total number of single base substitutions in
the selected dataset).
The display option allows the setting of a minimum percentage value for a codon to be labeled on the graph. A 3D view of most
frequent mutations within the DNA-binding domain is also displayed. This view requires that you download and install
CHIME plug-in (unzip with the free software ZipGenius).
You can get a Codon distribution from the Basic search or Advanced search options. The Basic search allow you to select
a tumor site (Topography) and/or a tumor type (Morphology) whereas with the Advanced search a precise set of data can be selected
by defining the type of tumor, the population affected and the references in which the mutations are described.
Only single base substitutions within the coding sequence are included in this graph (selection of other mutations will return no results).
The Function pattern option produces a scatter plot showing the proportion of single base substitutions in a
selected set of tumors, labeled with functional consequences and ordered according to their mutation rate. Different functional
labelling can be selected: (1) transactivation activity on 8 different promoters in yeast assays (as reported by
Kato et al.),
(2) functional predictions by tow different methods (SIFT and AGVGD) that are based on residue conservation from fish to placental mammals,
(3) functional predictions of missense mutations in the DNA-binding domain derived from structural features of mutants that are based
on a four-body statistical potential scores calculated from the Delaunay tessellation of the wild-type p53 structure.
The mutation rate for an amino-acid substitution is computed by taking the sum of the mutation probabilities of all individual nucleotide substitutions
that lead to this amino-acid substitution. See mutation annotations for details on these functional classes and on the computation of the mutation rates.
The mutation frequency was calculated as follows: number of each specific mutant divided by the total number of mutants with available functional
classification in the selected dataset.
You can get a Function pattern from the Basic search or Advanced search options. The Basic search allow you to select
a tumor site (Topography) and/or a tumor type (Morphology) whereas with the Advanced search a precise set of data can be selected
by defining the type of tumor, the population affected and the references in which the mutations are described.
Only missense, nonsense and silent mutations with mutation rates are included in these graphs (selection of other mutations will return no results).
The Tumor Spectrum option produces a histogram showing (1) for somatic mutations, the proportion of the selected
mutations by tumor sites (percent of the total number of mutations present in the database for each tumor site), (2) for germline
mutations, the type of tumors that have been reported in individuals who are confirmed constitutive carriers of the selected mutation(s)
(percent of the total number of tumors reported in these individuals).
This option allows you to select a precise set of data by including and/or excluding several criteria.
There are two search windows with the same criteria, the first one displays the 'criteria to
include' and the second one the 'criteria to exclude'. There are four categories of
criteria: criteria to describe the mutation, criteria to describe the tumor sample, criteria to define the patients
and criteria to select references in which the mutations have been reported. The criteria for 'Mutation', 'Sample' and 'Individual'
are displayed in the 'Advanced Search' window, whereas the 'Reference' criteria are displayed in a wizard, opened by clicking
on the 'Add' reference button. Once you have selected the criteria to include and/or exclude, you can get the same type of graphs
as for the basic search option.
Reference search
Click on 'Add' in the Reference criteria to open the reference search window. Enter a selection keyword (authors, journal name,
title, publication year, ...). All references that include this keyword in the selected field will be displayed one by one in
the wizard. Use the 'next' and 'previous' buttons to navigate between references. By default, all references are included.
To exclude a reference, uncheck the check box. If the wizard was opened from the 'criteria to include' window, only the mutations
described in the selected references will be included in the advanced search. If the wizard was opened from the 'criteria to
exclude' window, the mutations described in the selected references will be excluded from the advanced search.
This option allows you to retrieve the TP53 gene status of cell-lines that have been screened for TP53 and reported in the literature or in the Sanger database.
You may select any of the available criteria. A brief description of the retrieved cell-lines is displayed in the result window and detailed information can
be downloaded.
Search tips: As different names or description may have been used (A431 vs A-431, or MDA-MB-231 vs MDA-MB231) for the same cell-line,
it is not recommended to type the entire name of a cell-line, especially if it contains symbols such as - or /, . Use a partial name instead (431 or MDA),
as the cell-line name criteria automatically includes wild cards on both sides of the input.
This option allows the search of functional activities of p53 mutant proteins. You can select a mutation or a
group of mutations and get a table that shows the number of experimental data available for each category of functional properties.
From this table, select a category to get a table displaying experimental data obtained in yeast or human cells. With the download
option, you will get all functional properties of the selected mutations. Mutations that may not be easily selected with the available
criteria (Complex mutations) can be selected with a scrollbar menu.
This option allow to retrieve the proportion of mutated samples by cancer type (topography or morphology) and by country of
origin of the patients. You can select a type of cancer, a population and the method of mutation screening.
You have to select the type of graph you want to display. The prevalences shown on the result graphs are calculated from the
prevalence dataset (number of mutated samples divided by number of sample analyzed). Note that silent
mutations are counted as mutation event in this dataset. Note also that mutation prevalence is highly affected by the method of
detection, so please download and check the detailed information before interpreting graph results.
This option allows to select for a specific mutation and retrieve standard nomenclatures and database information for this mutation.
Different input format can be used to select for a mutation. The information retrieved includes the precise description of the mutation at the
DNA and protein level, observed (in experimental cell assays) or predicted (by amino-acid conservation rules, structural
analysis, or splicing prediction) functional impacts of the mutation, number of occurence as somatic or germline mutation, and mouse-model with engineered p53 mutant.
To input the correct numbers for mutation position, please check the reference sequences:
TP53 coding sequence
TP53 gene sequence
This option provides tools to visualize primary and tertiary sequences of p53 wild-type or mutant proteins. For the 3D viewer
and to display solved mutant structures, you need to download and install
CHIME plug-in (unzip with the free software
ZipGenius).
The Computational geometry analysis tools draw a histogram representing the Residual Score Profile of a selected mutant. Principle
and methods of the Computational geometry analysis is available here.
The full datasets can be downloaded from the "Database downloads" pages and selected subsets can be downloaded from the graph
pages with the "Database search" options. The 'Save as table' option allows to download the table containing the
data used to generate the graph and the 'Download data' option allows to download the complete set of data linked
to the selected data. Data are downloaded as tab-delimited text file compressed in a zip format. To open the
zip file, use the free software ZipGenius. Graphs image can be saved using the mouse right-click button over
the graph image. Description of the column contents are given in this help page.
|
| Top |
Description of datasets
Somatic dataset: Type of TP53 somatic mutations in human cancer
The dataset of Somatic Mutation contains exclusively TP53 somatic mutations that have been identified in
human tissues by sequencing and published in the peer-reviewed literature. This includes mutations found in normal, pre-neoplastic
and neoplastic tissues, including metastases, as well as in cell lines derived from such tissues. The database does not
include (1) individual data on human tissues that are reported as negative with respect to p53 mutation, (2) mutations not precisely
identified by sequencing (e.g. mutations identified only by SSCP, DGGE or restriction digestion), (3) experimentally-induced
mutations in tumor cells or cell lines in vitro, (4) p53 mutations in animal tumors.
TP53SomaticR14 is a compressed file in a zip format. It contains a Disclaimer and
two tab-delimited text files: TP53SomaticR14 that includes mutation data and
TP53SomaticRefR14 that includes the references in which are described the mutations. To open this file, use the free software
ZipGenius.
TP53SomaticR14
Each row in the table represents a single mutation with an arbitrarily assigned unique identification number.
A unique identification number is also attributed to the tumor sample and to the patient.
| Column |
Column head |
Description |
| Columns 1-16 describe
the mutation. |
| 1 |
Mutation_ID |
Unique identification number for the
mutation. Tandem mutations (two adjacent base substitutions) are
considered as one mutation event and are entered together;
therefore tandem mutations have only one identification number and
are a single record. |
| 2 |
MUT_ID |
see annotations |
| 3 |
ExonIntron |
see annotations |
| 4 |
Genomic_nt |
see annotations |
| 5 |
Codon_number |
see annotations |
| 6 |
Splice_site |
see annotations |
| 7 |
CpG_site |
see annotations |
| 8 |
Type |
see annotations |
| 9 |
Description |
see annotations |
| 10 |
c_description |
see annotations |
| 11 |
Mut-rate |
see annotations |
| 12 |
WT_codon |
see annotations |
| 13 |
Mutant_codon |
see annotations |
| 14 |
WT_AA |
see annotations |
| 15 |
Mutant_AA |
see annotations |
| 16 |
ProtDescription |
see annotations |
| 17 |
Mut_rateAA |
see annotations |
| 18 |
Effect |
see annotations |
| 19 |
Structural_motif |
see annotations |
| 20 |
Putative_stop |
see annotations |
Columns 21-35 are
assigned to the description of the organ site, tissue and type of
lesion where the mutation has been identified. The descriptions
given in the publication are translated into the standards
of the International Classification of Diseases
for Oncology (ICD-O 3rd Edition, World Health Organization, Geneva, 2000) and SNOMED.
For information on tumor classification, grading and staging, check out
ICD-O training at SEER,
Cancer
Information at NCI and Oncologychannel.com. |
| 21 |
Sample_Name |
Name or number given by the authors to
the tumor sample or cell line. If the name is not distinctive,
e.g., if the publication refers to samples as tumors 1,2,3, etc,
we have arbitrarily assigned a name, usually the first letters of
the first author's name, followed by the numbers in the series.
The same name or number can occur several times as in some samples
more than one mutation has been reported. |
| 22 |
Sample_ID |
Unique sample identification number.
This number allows the automatic retrieval of samples with
multiple mutations. |
| 23 |
Sample_source |
see annotations |
| 24 |
Tumor_origin |
see annotations |
| 25 |
Topography |
see annotations |
| 26 |
Short_topo |
see annotations |
| 27 |
Topo_code |
see annotations |
| 28 |
Sub_topography |
see annotations |
| 29 |
Morphology |
see annotations |
| 30 |
Morpho_code |
see annotations |
| 31 |
Grade |
see annotations |
| 32 |
Stage |
see annotations |
| 33 |
TNM |
see annotations |
| 34 |
p53_IHC |
p53 immunostaining graded as
‘positive’, ‘negative’ or ‘+/-‘. ND stands for not
done. |
| 35 |
Add_Info |
Any useful additional information is
entered here. |
Columns 36-49 are
assigned to the description of the patient origin and life-style.
They contain heterogeneous notes, usually comments emphasized by
authors reporting the mutations. The diversity of the terms
occurring in columns 36-40 reflects the heterogeneity of the
information available in the publications from which the data are
compiled. It should be noted that this information is generally
qualitative. No quantitative information on exposure of risk
factors is included in the database. This information does not
presuppose that a formal, causal link has been established between
such factors and the mutation described. Moreover, for most
exogenous risk factors, individual exposure has not been
monitored. This information is given solely
to (i) permit the retrieval of mutations found in patients
belonging to defined groups or having specific risk factors, and
(ii) facilitate access to the corresponding publications. For
detailed comparison between exposure groups, users are invited to
perform their own analysis based on the information given in the
original publication. |
| 36 |
Individual_ID |
Unique identification number for an
individual included in the database. It is automatically assigned
by the database system. |
| 37 |
Sex |
see annotations |
| 38 |
Age |
see annotations |
| 39 |
Ethnicity |
see annotations |
| 40 |
Population |
see annotations |
| 41 |
Country |
see
annotations |
| 42 |
Geo_area |
see
annotations |
| 43 |
TP53polymorphism |
Presence of a polymorphism in TP53
gene. |
| 44 |
Germline_mutation |
Presence of a germline mutation in
TP53 or any other gene. |
| 45 |
Family_history |
Information on the presence or absence
of cancers in the family of the patient. |
| 46 |
Tobacco |
Information on the smoking status of
the patient. Terms occurring in this column are 'smoker' (with
qualitative amount in brackets), 'non-smoker', 'passive-smoker'
and chewer. |
| 47 |
Alcohol |
Information on the drinking status of
the patient. Terms occurring in this column are ‘drinker’
(with qualitative amount in brackets), and 'non-drinker'. |
| 48 |
Exposure |
Risk factors to which the patient has
been exposed to, such as aflatoxins, radon, thorotrast, etc… |
| 49 |
Infectious_agent |
Pathogen (virus or bacteria) detected
in the patient. |
| 50 |
Ref_ID |
Unique identification number for the
reference in which the mutation is described. The citation is
given in the reference file RefR5. |
| 51 |
PubMed |
PubMed reference number provided by
NCI. |
TP53SomaticRefR14
This file lists the publications in which are described the mutations
and gives the method used to detect the mutations. Each row (record) represents a
citation with an arbitrarily assigned unique identification number
(Ref_ID). See standardized annotations for the
description of the column content.
|
| Top |
|
Prevalence dataset: Prevalence of TP53 somatic mutations in cancer
The Prevalence Table provides information on the prevalence of TP53
mutation by tumor type. It contains all publications having
identified the mutations by sequencing. This includes publications
contained in TP53SomaticR14Ref, but also publications that do not give a
detailed description of the mutations (many recent studies do not
provide detailed information on each mutation detected but rather report
their results in the form of summary tables), and publications with
negative results (no mutation found).
For each study, the total number of tumors or tissue samples analyzed,
and the number of these samples which were found to contain a mutation
are recorded. The reference paper, method of mutation detection and
country of origin of the patients are also indicated. When the same
research team published several papers that describe the same set of samples,
data from the most recent paper are used.
The file TP53PrevalenceR14.zip is
compressed in a zip format. To open this file, use the free software ZipGenius. It contains a Disclaimer,
and a tab-delimited text file, TP53PrevalenceR14.txt.
| Column |
Column
head |
Description |
| Columns 1-11 describe
the samples and mutation prevalence |
| 1 |
Prevalence_ID |
Unique entry identification number. |
| 2 |
Topography |
see annotations |
| 3 |
Topo_code |
see annotations |
| 4 |
Morphology |
see annotations |
| 5 |
Morpho_code |
see annotations |
| 6 |
Sample_analyzed |
Number of tumor samples analyzed for
TP53 mutations. |
| 7 |
Sample_mutated |
Number of tumor samples with a
mutation in TP53. |
| 8 |
Population |
see annotations |
| 9 |
Country |
see annotations |
| 10 |
Comment |
Any relevant information. |
| Columns 11-35 describe
the citation and the method used to identify the mutations (see
annotations). |
|
Top |
| Prognosis dataset: Prognostic value of TP53 somatic mutations
A "Prognosis" information unit has been added to the
Somatic Mutation Database since the R7 release. This feature consists in
a table that includes information on all studies that have analyzed the
relationship between p53 mutations and cancer prognosis. For each study,
the patient cohort, study settings and a summary of the results are
described. When the same research team published several papers with
increasing number of patients, the most recent paper with the largest
dataset is used.
Many of these studies do not provide detailed information on each
mutation detected but rather report their results in the form of summary
tables. Such publications have been included in the prognosis table,
without mutation data attached (these references will not appear in the
SomaticR14 file or through the database search system). For some of them, the mutations
have been published in a previous paper and can be retrieved with the
Cross_Ref_ID (see below).
The file TP53PrognosisR14.zip is compressed in a zip
format. To open this file, use the free software ZipGenius. It contains a Disclaimer,
and data either in a tab-delimited text file, TP53PrognosisR14.txt,
or in a graphical output format in Microsoft Access, TP53PrognosisR14.mdb.
The text file is compatible with softwares that handle tab-delimited
formats, whereas the mdb file can only be opened with Microsoft
Access 2000 software. Data in Microsoft Access format are easy
to view and search with the filter function of Access forms (click
here for help with the form filter function).
The table contains the following information:
| Column |
Column
head |
Description |
| Columns 1-15 describe
the patient cohort, the study settings and the results of the
study |
| 1 |
Prognosis_ID |
Unique entry identification number. |
| 2 |
Topography |
see annotations |
| 3 |
Morphology |
see annotations |
| 4 |
Population |
see annotations |
| 5 |
Country |
see annotations |
| 6 |
Institution |
Name of the hospital(s) where the
patients have been recruited. |
| 7 |
Period |
Time period (year) during which the
patients have been recruited. |
| 8 |
Inclusion criteria |
ICD-O
(3rd edition) or SNOMED code for morphology |
| 9 |
Treatment |
Treatment protocol used for most of
the patients. SU, surgery; CX, chemotherapy; RX, radiotherapy; pre-op, pre-operative; CP, cyclophosphamide; CISP, cisplatin; doxo, doxorubicin; 5-FU, 5-fluorouracil
|
| 10 |
Median FU |
Median follow-up time of the patients
in month. |
| 11 |
Range FU |
Range follow-up time of the patients
in month. |
| 12 |
Cohort |
Number of patients/tumors analyzed for
TP53 mutations. |
| 13 |
P53 mutations |
Number
of patients/tumors with a mutation in TP53. |
| 14 |
Percent mutated |
Proportion of mutated tumors (%). |
| 15 |
Parameter_analyzed |
Clinical parameter analyzed (patient
survival and/or tumor response to treatment). |
| 16 |
Association |
Summary result: association with the
presence of a TP53 mutation. |
| 17 |
Result |
Main findings. |
| Columns 18-41 describe
the reference and the method used to identify the mutations (see
annotations). |
|
Top |
|
Germline dataset: Database of TP53 germline mutations and Li-Fraumeni syndrome
TP53GermlineR14 is a compressed file in a zip format.
It contains a disclaimer and two tab-delimited
text files: TP53GermlineDataR14 including
the mutation data and TP53GermlineRefR14
including the references.
To open this file, use the free softwareZipGenius.
TP53GermlineDataR14
Each row (record) represents a tumor found in an individual having a TP53
germline mutation. The columns contain the following information and
abbreviations:
note: na= non applicable
| Column |
Column
head |
Description |
| 1 |
Family_ID |
Unique family identification number. |
| 2 |
Family_Code |
Name or number given in the original
publication or an arbitrarily-assigned name, usually the 3 first
letters of the first author's name and the publication date. |
| 3 |
Country |
see annotations |
| 4 |
Class |
Family classification:
LFS = strict clinical definition of Li-Fraumeni
syndrome (defined by Li and Fraumeni as a Proband with sarcoma
<45 years with a first degree relative with cancer at <45
and another first/second degree relative with cancer at <45 or
sarcoma at any age);
LFL = Li-Fraumeni like for the extended clinical
definition of Li-Fraumeni (including Birch definition:
proband with any childhood cancer or sarcoma, brain tumor or
adrenocortical carcinoma at <45 years, with one first or second
degree relative with sarcoma, breast cancer, brain tumor,
leukemia, or adrenocortical carcinoma at any age, plus one first
or second degree relative in the same lineage with any cancer
diagnosed under age 60; Eeles definition E1: two different tumors
which are part of extended LFS in first or second degree relatives
at any age (sarcoma, breast cancer, brain tumor, leukemia,
adrenocortical tumor, melanoma, prostate cancer, pancreatic
cancer); Eeles definition E2: sarcoma at any age in the proband
with two of the following (two of the tumors may be in the same
individual): breast cancer at <50 years and/or brain tumor,
leukemia, adrenocortical tumor, melanoma, prostate cancer,
pancreatic cancer at <60 years or sarcoma at any age).
FH: family history of cancer which does not
fulfil LFS or any of the LFL definitions (Birch, Eeles E1 or E2,);
No FH: no family history of cancer.; FH= Family
history of cancer (not fulfilling the definition of LFS/LFL); No=
no family history of cancer; ?= unknown. |
| 5 |
Generations_analyzed |
Number of generations analyzed in the
family. |
| 6 |
Germline_mutation |
A TP53 germline mutation has been
identified. |
| 7 |
ExonIntron |
see annotations |
| 8 |
Genomic_nt |
see annotations |
| 9 |
Type |
see annotations |
| 10 |
Codon_number |
see
annotations |
| 11 |
Description |
see annotations |
| 12 |
c_description |
see annotations |
| 13 |
WT_codon |
see annotations |
| 14 |
Mutant_codon |
see annotations |
| 15 |
WT_AA |
see annotations |
| 16 |
Mutant_AA |
see annotations |
| 17 |
Effect |
see annotations |
| 18 |
Individual_ID |
Unique identification number for an
individual included in the database. It is automatically assigned
by the database system. |
| 19 |
Individual_code |
Code or number given in the original
publication or an arbitrarily-asigned code, usually the family
code followed by the position of the individual in the family
tree. |
| 20 |
FamilyCase |
Family case in the pedigree, such as proband (index case), mother, father,.... |
| 21 |
Generation |
Position of the individual in the
family tree. |
| 22 |
Sex |
Sex of the individual. |
| 23 |
Germline_carrier |
TP53 mutation status of the individual:
confirmed= the individual has been tested for the presence of the mutation and the mutation has been found;
obligatory= the individual has not been tested for the presence of the mutation but must be carrier based on
the mutation status of the other individuals in the pedigree;
50%prob.= there is a chance of 50% that the individual is a mutation carrier;
negative= the individual has been tested for the presence of the mutation and the mutation has not been found;
NA= the individual has not been tested for the presence of the mutation. |
| 24 |
Mode_of_inheritance |
Parent from which the mutation has
been inherited. P=paternal, M=maternal, M&P=maternal and paternal, de novo= new mutation that has not been inherited, na=not known. |
| 25 |
Dead |
Living status of the individual at
time of follow-up. 0=alive; 1=dead |
| 26 |
Unaffected |
Disease status of the individual at
time of follow-up. 0 = affected by cancer; 1 = not affected by
cancer. |
| 27 |
Age |
Age of the individual at the time of
follow-up. |
| 28 |
Topography |
see annotations |
| 29 |
Morphology |
see annotations |
| 30 |
Age at diagnosis |
Age of the individual at the time of
diagnosis of the tumor. |
| 31 |
Ref_ID |
Reference number indicating the
publication in which the mutation is described. This number
corresponds to the Ref_ID number in the GermlineRefR14 file. |
GermlineRefR14
Each row represents a reference identified by a unique identification
number (Ref_ID). See standard annotations
for a description of the column content.
|
| Top |
|
P53MutFunction datasets: Functional properties of p53 mutant proteins
The files TP53MUTfunction1R14 and TP53MUTfunction2R14 are compressed in a zip
format. To open these files, use the free software ZipGenius. They contain a Disclaimer
and a tab-delimited text file, TP3MUTfunction1/2R14.txt.
Data were extracted from publications that report functional assessment of p53 mutant proteins in human or yeast cells, either
by transfection and overexpression of mutant proteins, or by assessment of endogenous mutants.
Comparison between mutants requires caution since functional assays
differ from one study to the other, in particular with respect to
the expression vector (which influences the level of
expression of the mutant protein), the p53-responsive elements
(generic consensus sequence versus gene-specific response elements from
WAF1, BAX or PIG3), and the recipient cells that have been used. Indeed,
p53 mutant functional properties appear to be dependent upon the type of
cells and response elements.
P53MUTfunction1R14
The functional properties of mutant proteins that are included in this dataset are:
- transcriptional activities on various well-described p53-RE (see list here),
- dominant negative effects on the activities of wild-type p53,
- capacity to induce apoptosis, cell-cycle arrest or checkpoints in human cells,
- capacity to transactivate promoters that are not induced by wild-type p53,
- ability to promote cell growth and confer tumorigenicity,
- sensitivity to temperature changes regarding their ability to transactivate specific promoters.
The functional results have been organised in 5 columns for (1) conserved
wild-type properties, (2) complete or partial loss of wild-type properties, (3) dominant-negative effects, (4) gain
of function and (5) temperature sensitivity. The cell system is indicated in two columns and a detailed reference to the
published report is given.
The columns contain the following
information and abbreviations:
Note: NA= Non Applicable
| Column |
Column
head |
Description |
| 1 |
Function_ID |
Unique identification number for each
entry. |
| 2 |
ProtDescription |
see
annotations |
| 3 |
WT_AA |
see annotations |
| 4 |
Codon number |
see annotations |
| 5 |
Mutant_AA |
see annotations |
| 6 |
SwissProtLink |
see annotations |
| 7 |
Structural_motif |
see annotations |
| 8 |
Codon 72 |
Amino-acid at codon 72 of p53 (polymorphism) |
| 9 |
Conserved
WT function |
Functional property of mutant that is similar to the activity of the wild-type protein.
- Activities of mutant proteins in human or yeast cells:
DNAb = DNA binding capacity tested by gel-shift or ChIP assay;
TA = transactivation of a reporter gene under the control of a p53-response element
(indicated in brakets, see list here);
TR = transrepression of a reporter gene under the control of a gene-specific response-element
(name of gene indicated in brakets)
TETR = capacity to form tetramers;
x binding = interaction with protein x;
drug sensitivity = conserved capacity to mediate cytotoxic effect of drug (specific drug used is indicated, see list of abbreviations).
- Activities of over-expressed mutant proteins in human cells:
APO = induction of apoptosis;
GS = growth suppression measured by colony forming assay (CFA), establishment of stable clones, or other proliferation assay;
GA = cell cycle arrest measured by FACS;
TUMOR- = inhibition of tumorigenicity in nude mice;
up/downregulation = induction or repression of an endogenous GENE (in upper-case letters) or protein (in lowercase letters);
HR repression = inhibition of Homologous Recombination;
- Biological effect after over-expression in mouse or rat embryonic fibroblasts:
TRANSF- = ability to counteract the transformation of primary cells induced by the co-transfection of ras or another
transformant oncogene, such as HPV E7;
"super" indicates that the activity of mutant protein is higher than the one of wtp53 (on transactivation, induction of apoptosis, DNA binding or growth suppression).
|
| 10 |
Loss of
Function |
WTp53 functional property that is lost by the mutant protein.
Same annotations as in column 9.
"partial" indicates that the loss of function is partial (residual activity).
|
| 11 |
Dominant
negative activity |
Inhibition of the wild-type protein by
mutant proteins in transactivation or cell growth assays.
- Yes = the mutant protein counteract the activity of the wild-type
protein when the two proteins are co-expressed in human or yeast
cells (the p53-response element or cell growth assay performed is indicated in brakets);
- No = the mutant protein does not counteract the effects of the
wild-type protein.
"moderate" indicates that the mutant protein has a partial inhibiting effect on
the wild-type protein. |
| 12 |
Gain of
Function |
Functional properties displayed by the
mutant but not by the wild-type protein.
- Activities of over-expressed mutant proteins in human or yeast cells:
same annotations as in column 9, plus:
TUMOR+ = confer tumorigenic property (in nude mice) to transfected cells;
p73 interference = ability to counteract p73 activity when both proteins are expressed in a cell system;
Drug resistance = confer resistance to a cytotoxic drug (see list of abbreviations);
Growth advantage = increase growth rate.
- Biological effect after over-expression in mouse or rat embryonic fibroblasts:
TRANSF+ = ability to cooperate with ras or another transformant oncogene, such as HPV E7, in the transformation of
primary cells.
"moderate" indicates that the mutant protein has a partial effect on the activity studied;
"no" indicates that the mutant protein has no effect on the activity studied.
|
| 13 |
Temperature
sensitivity |
Sensitivity of mutant to temperature changes in transactivation assays
(the p53-RE is indicated in brackets), and in other experimental assays (specified in brackets).
Yes = the activity of the mutant protein is affected by the
temperature at which is preformed the test;
mut_H = the protein is inactive (mutant) at higher temperatures;
mut_L = the protein is inactive (mutant) at lower temperatures;
No = the activity of the mutant protein is NOT affected by the
temperature at which is preformed the test.
Note that functional tests are performed at different temperature
in yeast (30ÝC) and human (37ÝC) cells. Detailed annotation rules are available here. |
| 14 |
Temp_ref |
Temperature at which experiments have been performed or which has been used as reference for temperature sensitivity assays. |
| 15 |
Cell assay |
Human = the activity of the mutant
protein has been tested in human cells.
Yeast = the activity of the mutant protein has been tested in the
yeast. |
| 16 |
Cell-lines |
Name of cell-line(s) that have been
used for testing mutant activities. "(endo)" indicates that activities have been tested on endogenous mutants. |
| 17 |
Assay design |
Indicates if the assay has been performed with or without wtp53 as control, or if activity has been tested on endogenous mutant. |
| 18 |
Method |
Details on type of experimental assay that was performed to assess function. |
| Columns 19-26
describe the citation identified by a unique identification number
(FRef_ID)(see annotations). |
P53MUTfunction2R14
The functional data that are included in this dataset were provided by Chikashi Ishioka and have been published in
Kato et al.,
Kakudo et al., and
Kawaguchi et al..
The columns contain the following information and abbreviations (for the detailed experimental procedures, please see the above references):
Note: NA= Non Applicable
| Column |
Column
head |
Description |
| 1 |
ProtDescription |
see
annotations |
| 2 |
Codon number |
see annotations |
| 3-10 |
WAF1nWT, MDM2nWT, BAXnWT,... |
Promoter-specific transcriptional activity measured in yeast functional assays
and expressed as percent of wild-type activity. |
| 11-16 |
WAF1nWT_Saos2, MDM2nWT_Saos2,... |
Promoter-specific transcriptional activity measured in the human cell-line Saos-2 and expressed as percent of wild-type activity. |
| 17 |
SubG1nWT_Saos2 |
Induction of apoptosis by overexpression in Saos-2 cells expressed as percent of wild-type activity. |
| 18 |
Oligomerisation_yeast |
Capacity of mutant protein to form oligomer:
TETR=can form tetramer, DIM=can form dimer but not tetramer, MON= can not oligomerarize. |
|
Top |
| Cell-lines dataset: TP53 status of Human cell-lines
The file TP53CellLinesR14 is compressed in a zip
format. To open this file, use the free software ZipGenius. It contains a Disclaimer
and a tab-delimited text file,TP53CellLineR14.txt,
which is compatible with softwares that handle tab-delimited formats.
The current list of cell-lines include cell-lines that have been screened for TP53 mutation and are either
published in the scientific literature or listed in the
Sanger cell-line database. Cell-lines with a wild-type or mutated TP53 gene
are listed.
The file contains contains the following information:
note: na= non applicable
| Column |
Column head |
Description |
| 1 |
Sample_name |
Name of the cell-line. |
| 2 |
ATCC_ID |
Identification number of the
ATCC database. |
| 3 |
Cosmic_ID |
Identification number of the
Cancer Cell Line Project and COSMIC databases of the Sanger Institute. |
| 4 |
Short_topo |
see annotations |
| 5 |
Morphology |
see annotations |
| 6 |
Tumor_origin |
see annotations |
| 7 |
Add_info |
|
| 8 |
Sex |
Gender of the patient from whom the cell-line has been isolated. |
| 9 |
Age |
Age at cancer diagnosis of the patient from whom the cell-line has been isolated. |
| 10 |
Country |
see annotations
|
| 11 |
Population |
see annotations
|
| 12 |
Germline_mutation |
Germline mutation in TP53 or any other gene carried by the individual from which the cell-line has been isolated. |
| 13 |
Infectious_agent |
Infectious agent (virus or bacteria) detected in the individual from which the cell-line has been isolated. |
| 14 |
Tobacco |
Smoking habit of the individual from which the cell-line has been isolated. |
| 15 |
Alcohol |
Drinking habit of the individual from which the cell-line has been isolated. |
| 16 |
Exposure |
Reported exposure of the individual from which the cell-line has been isolated. |
| 17 |
KRAS_status |
Status of KRAS gene. WT=
wild-type; MUT=mutant (base change indicated in brackets) |
| 18 |
Other_mutations |
Name of other genes in which a mutation has been identified. |
| 19 |
TP53status |
Status of TP53 gene. WT=
wild-type gene sequence; MUT=mutated gene sequence; NULL=entire gene deletion |
| 20 |
p53_IHC |
p53 immuno-staining status. |
| 21 |
p53_LOH |
Loss of heterozygocity at p53 locus. Yes= LOH, No= no LOH, NI= non informative, NA= no information |
| Columns 22-44
TP53 mutation description and functional properties (see
annotations). |
| Columns 45-50
reference from which the information on the cell-line has been extracted (same as somatic Ref_ID)(see
annotations). |
| Columns 51-63
methods for mutation detection (see
annotations). |
|
Top |
| Polymorphisms: gene variations observed in unaffected human populations
The list of polymorphisms shown in the table includes gene variations that have been reported in publications or SNP databases
and validated at least by frequency in unaffected human populations.
The table contains the following information:
note: na= non applicable
| Column |
Column head |
Description |
| Columns 1-5
TP53 mutation location and descriptions at the genomic, coding and protein level (see
annotations). |
| 6 |
Avg Het |
Frequency (average) of heterozygotes found in analyzed population(s). For detailed information, follow the dbSNP or PubMed link. |
| 7 |
SNP_link |
Link to NCBI SNP database |
| 8 |
P53KB_link |
Link to the p53Knowledgebase that provides information on population frequencies and association with disease for the selected SNP. |
| 9 |
PubMed |
Link to PubMed of the publication that first described the SNP when applicable. |
|
Top |
| Mouse models: mouse with engineered Trp53 gene
The table contains mouse models with engineered p53 that are compiled in the caMOD database or reported in the scientific literature.
Data curated at caMOD were courteously provided by the caMOD team. A direct link to the caMOD web site is available for each model for a detailed
description of model genetics and phenotypes. Data reported in the literature but not compiled in caMOD are curated at IARC and a link to PubMed abstract is provided.
For a detailed description of model genetics and phenotypes, please refer to caMOD and/or original publication
The table contains contains the following information:
note: na= non applicable
| Column |
Column head |
Description |
| 1 |
Model descriptor |
Model name as indicated in caMOD or original publication. |
| 2 |
Affected organs |
List of organs affected or targeted by transgene. |
| 3 |
AA change in human |
Amino-acid substitution. Note that amino-acids are numbered according to the human sequence. |
| 4 |
caMOD link |
Model ID from caMOD database with direct link to caMOD database (https://cancermodels.nci.nih.gov/camod/login.do). |
| 5 |
PubMed |
PMID with link to PubMed abstract that describe the model. |
|
| Top |
Mutation
| Column head |
Description |
| MUT_ID |
Unique identifier of each gene variation reported in the database.
This identifier is used in all datasets (somatic, polymorhisms, germline).
|
| ExonIntron |
Location of the mutation in the
introns or exons in TP53 gene. Terms occurring in this column
are "1-intron"' to "11-intron" and
"2-exon" to "11-exon". An "i" or "e" in front mean that the mutation
is located within the indicated intron or exon with no information on the precise location.
|
| Genomic_nt |
Genomic nucleotide position of the
mutation based on the Genbank NC_000017 (7512445..7531642) reference sequence. |
| Codon_number |
For mutations in exons, codon number
at which the mutation is located (1-393). If a mutation spans more
than one codon, (e.g. tandem mutation or deletion of several
bases) only the first (5') codon is entered. For mutations in
introns, 0 is entered. |
| Splice_site |
Indicate if the position of the mutation falls within conserved nucleotides of p53 consensus, criptic or alternative splice sites:
consensus SD or SA= conserved dinucleotides of p53 consensus splice sites (SD for splice donor site, SA for splice acceptor site) that produce the full-lenght p53 protein (TA isoform);
criptic SD or SA= conserved dinucleotides of splice sites (gt or ag) that have been observed experimentally in p53 sequences in which a consensus plice site was mutated;
alternative SD or SA= conserved dinucleotides of splice sites (gt or ag) responsible for producing p53 isoforms beta and gamma;
alternative = nucleotides in the "cassette" sequence responsible for producing the p53 delta isoform;
no= position is outside the above mentioned nucleotides.
Information on splice site can be found here. |
| CpG_site |
Yes or No indicate if the position of the mutation falls within a CpG
sequence or not. To see the position of all CpG sites in TP53 coding
sequence, click here. |
| Base |
Wild-type base at the position of the mutation. |
| Mutant_nucleotide |
Mutant base. |
| Description |
For substitutions, nucleotide
change read from the coding strand by convention. For
deletions and insertions, the number of bases deleted (del) or
inserted (ins) is given. For more complex mutation events, a full
description is given as indicated in the original publication.
Note that the annotation system has been modified on February 7th, 2007
to fix errors that were previously generated in the dowloaded files.
|
| c_description |
Mutation nomenclature according to HGVS standards with the coding sequence as reference. |
| Type |
Nature of the mutation. The terms
occurring in this column are "A:T>C:G" (A to C or T
to G base change), "A:T>G:C" (A to G or T to C base
change), "A:T>T:A" (A to T or T to A base change),
"G:C>A:T" (G to A or C to T base change at non CpG
sites), "G:C>A:T at CpG" (G to A or C to T base
change at CpG sites),
"G:C>C:G" (G to C or C to G base change),
"G:C>T:A" (G to T or C to A base change),
"tandem" (two consecutive base changes), "ins"
(insertion), "del" (deletion) and "complex"
(complex changes). |
| Mut_rate |
Substitution rates were calculated for all single base substitutions in the coding sequence of p53 according to the dinucleotide substitution rates
derived from human-mouse aligned sequences of chromosomes 21 and 10 (Lunter and Hein 2004).
The mutation probabilities for a given single nucleotide substitution are calculated by averaging the dinucleotide substitution
rates at that position for the forward and reverse strands. |
| WT_codon |
For mutations in exons, normal base
sequence of the codon in which the mutation occurred. |
| Mutant_codon |
Base sequence of the mutated codon. |
| WT_AA |
Wild-type amino acid encoded at the
codon in which the mutation occurred (three-letter or one-letter amino acid
abbreviation). AA letter code Genetic code |
| Mutant_AA |
Mutated amino acid encoded at the
codon in which the mutation occurred (three-letter or one-letter amino acid
abbreviation). The chain terminating mutations due to single base
substitutions are designated by "stop". AA letter code Genetic code |
| ProtDescription |
Mutation description at the protein
level as recommended by HGVS |
| Mut_rateAA |
Mutation rate of amino-acid substitution calculated by summing up the nucleotide substitution rates (see above).
This value is only valid for amino-acid substitutions resulting from single nucleotide substitutions. |
| Effect |
Effect of the mutation. The terms
occurring in this column are: missense (change of one amino-acid), nonsense (stop codon), FS
(frameshift), silent (no change in the protein sequence),
splice (mutations located in the two first and two last conserved nucleotides of the introns and are thus predicted to alter splicing,
or mutations that have been shown to alter splicing experimentally), other (inframe deletions or insertions, mutations
in introns) |
| Polymorphism |
Polymorphic status of the gene variation.
Validated : validated by frequency in control population(s) with freq >0.5% according to dbSNP, PubMed or IARC data;
No : validated by frequency in control population(s) with freq <0.5% according to dbSNP, PubMed or IARC data,
or mutation present in IARC somatic dataset with n>=20 and no frequency data in control populations,
or STOP, FS and splice mutations;
or mutations associated with LFS/LFL-B/Chompret criteria.
Unknown : all other mutations.
|
| Validation |
Type of validation for polymorphism, as indicated in NCBI SNP database.
|
| SNPlink |
Link to NCBI SNP database.
|
| Residue function |
Known function of the wild-type residue. When the function is not known but the structure
is known, the solvent accessibility (SA) of the residue is indicated by the terms buried, exposed or partially exposed
(SA calculated with Naccess and 1TSR (chain B) structure of p53: <20 = buried, >=20 and <50 = partially exposed, >=50 = exposed). |
| Domain function |
Function of the domain in which the mutated residue is located. |
| Structural_motif |
2D and 3D motifs where the mutation is located according to structures described in Cho
et al. (1994) and May and May (1999) |
| SA |
Solvent accessibility of the wild-type residue as calculated with
Naccess and the 1TSR (chain B) structure of p53. |
| GV |
Grantham variation. GV is a measure of the amount of observed biochemical
variation at a particular position in a multiple sequence alignment. GV was calculated with an alignment containing 9 sequences of p53 from fish to placental mammals with
the A-GVGD program. More details here. |
| GD |
Grantham deviation. GD is a measure of the deviation of the mutated residue from the different types of residues observed at that postion in a multiple sequence alignment.
It is derived from GV and the Grantham matrix. More details here. |
| AGVGDclass |
Prediction of functional impact based on protein sequence conservation, taking into acount GV and GD (see above). Mutations are classified as "neutral", "deleterious" or "unclassified". More details here. |
| SIFTclass |
Functional classification based on SIFT program using the same sequence alignment as for AGVGDclass and program default settings. Missense mutations are classified as "deleterious" or "neutral". |
| Transactivation |
Promoter-specific transcriptional activity measured in yeast functional assays and expressed as percent of wild-type activity.
Data from Kato et al. |
| Transactivation class |
Functional classification based on the overall transcriptional activity on 8 different promoters as measured by Kato et al.
For each mutant, the median of the 8 promoter-specific activities (expressed as percent of the wild-type protein) is calculated and missense mutations are classified as "non-functional" if the median is <=20,
"partially functional" if the median is >20 and <=75, "functional" if the median is >75 and <=140, and "supertrans" if the median is >140. |
| Structure/Function class |
Functional predictions derived from a computer model that takes into account the 3D structure of WT and mutant proteins and is trained on the transactivaton dataset from
Kato et al. Mutations are classified as "functional" or "non-functional". More details here. |
| DNE class |
Dominant-negative activity over wild-type p53. Classification established for 121 mutants either with available DNE status on both WAF1 and RGC promoters (these promoters were the most frequently used in different
studies to assess DNE status), or mutants from a large systematic study by Dearth et al that included 76 mutants.
Mutants were classified as "Yes" if they had dominant-negative activity on both WAF1 and RGC promoters, or on all promoters in the large study, "Moderate" if they had dominant-negative activity on some promoters and not on others, and "No" if they had no dominant-negative activity on both WAF1 and RGC promoters, or none of the promoters in the large study. |
| EffectGroup3 |
Mutation classification based on protein 3D structure and mutation type.
This classification has been used to derive gentoype-phenotype correlations in sporadic breast cancers (Olivier et al., 2006).
0=silent+intron;
1=missense in DNA-binding loops(L2,H1,L3,L1,S2,S2',H2);
2=other missense;
3=inFrame del/ins;
4=FS+splice+nonsense.
|
| SwissProtLink |
SwissProt identification number with link to the variant page of the SwissProt database. |
|
Top |
| Tumor site, type and origin
Tumor samples are classified according to standards
of the International Classification of Diseases
for Oncology (ICD-O 3rd Edition, World Health Organization, Geneva, 2000) and SNOMED.
For information on tumor classification, grading and staging, check out
ICD-O training at SEER,
Collaborative staging initiative,
Cancer Information at NCI,
and Oncologychannel.com.
| Column
head |
Description |
| Sample_source |
Nature of the sample from which the
mutation has been identified: cell-line, surgery (surgical or
autopsy specimen, including fresh samples and archival, pathology
specimen), biopsy, xenograft, body fluid (blood, saliva,
urine...). |
| Tumor_Origin |
Origin of the tumor sample. Terms
occurring in this column are: primary, secondary (second primary
tumor in the same patient), metastasis (with the localisation of
the metastasis in brackets), recurrent (tumor recurrence). |
| Topography |
Site of the tumor defined by organ or
group of organs, according to the ICD-O nomenclature. (examples:
"colon", "brain", "bronchus and
lung"). Note that some tumors are annoted
"Head&Neck,NOS" or "Colorectum,NOS"
because no detail is given in the original publication (NOS= not
otherwise specified).
For the database search tool, a short name is used in place of the
ICD-O name (example: "Lung" for "bronchus and
lung"). See a
numerical list of topographies.
For metastasis, the topography corresponds to the primary
site of the tumor and the site of metastasis is indicated in
brackets in the tumor_origin field. |
| Short_topography |
For the database search tool, a short
name is used in place of the ICD-O name (example: "Lung"
for "bronchus and lung"). See
a numerical list of topographies. |
| Topo_code |
ICD-O
code for topography. More here. |
| Sub_topography |
Precise identification of anatomic
site, organ or tissue. The description given in the publication is
translated to ICD-O nomenclature. |
| Morphology |
Tumor type, including morphology
and/or histologic type. The terminology used is based on ICD-O (2nd
and 3rd editions) and SNOMED classifications. Terms have been
added, such as 'normal tissue' or 'na'. See
alphabetical list of morphologies. |
| Morpho_code |
ICD-O
or SNOMED codes for morphology. More here. |
| Grade |
Information on tumor grade, as given
in the cited publication. |
| Stage |
Information on tumor stage, as given
in the cited publication. |
| TNM |
TNM classification (Tumor size, Node
status, Metastasis status) for staging. For information on this
classification system, click here. |
|
Top |
Demographic information of patient/individual
| Column
head |
Description |
| Sex |
Sex of the patient (M for male, F for
female). |
| Age |
Age of the patient at the time of
diagnosis. |
| Ethnicity |
Ethnicity of the patient (when
available). Groups are defined as: Asian, Black, Caucasian... |
| Population |
Patient population group. To see the
country/population classification click
here. |
| Country |
Country in which the patient was
living at the time of surgery. When not otherwise specified in the
original publication, the country corresponding to the address of
the hospital is entered. |
| Geo_area |
City or region within the country of
living of the patient. When not specified in the original
publication, the city where the surgery has been done is entered. |
|
Top |
Reference
The same references (same Ref_ID) are used for the somatic,
prevalence and prognosis data sets. Independent references are used for
the Function and Germline data sets.
|
Top |
Mutation detection method
| Column
head |
Description |
| Tissue_processing |
Indicates if the sample analysed was
fresh, fixed or frozen. |
| Start_material |
Indicates if DNA or RNA was used to
search for mutations. |
| Prescreening |
Method used to detect the mutations:
‘SSCP’ for single strand polymorphism, ‘DGE’ for
denaturant gel electrophoresis, ‘Yeast’ for yeast assay,
‘none’ for direct sequencing’, etc… |
| Material_sequenced |
Indicates if the DNA or RNA was cloned
or not before sequencing. |
| Exon2-11 |
"-1" or "TRUE" indicate that the exon has been
screened and "0" or "FALSE" indicate that it has not been screened. |
|
Top |
List of abbreviations used in the database
A...E
| 5-aza-CdR |
5-aza-2’-deoxycytidine |
| AILD |
Angioimmunoblastic Lymphadenopathy
with Dysproteinemia |
| ALL |
Acute lymphoblastic leukemia |
| AML |
Acute myeloid leukemia |
| APC |
Adenomatous polyposis coli gene |
| BLEO |
bleomycin |
| CHEM |
Chemotherapy |
| CIS |
Carcinoma in situ |
| CDDP |
cisplatin |
| CML |
Chronic Myeloid Leukemia |
| CMML |
Chronic Myelomonocytic Leukemia |
| CPT |
camptothecin |
| ddF |
Dideoxyfingerprinting |
| DFI |
Disease-free interval |
| DGE |
Denaturant gel electrophoresis |
| Dif. |
Differentiated |
| DBM |
DNA-binding motif |
| DOX |
doxorubicin |
| Duke’s |
Classification for colon
cancer |
| EBV |
Epstein-Barr Virus |
| ER |
Estrogen Receptor |
| ETO |
etoposide |
|
Top |
F...J
| |
FAB classification for ALL |
| FAP |
Familial adenomatous polyposis |
| FGO |
Female Genital Organs |
| FH |
Family history |
| FIGO |
Classification for gynecological
cancers |
a
| GEM |
gemcitabine |
| H’sD,
HD |
Hodgkin’s disease |
| HBV |
Hepatitis B virus |
| HCC |
Hepatocellular carcinoma |
| HCV |
Hepatitis C virus |
| HNPCC |
Hereditary Nonpolyposis Colorectal
Cancer |
| HPV |
Human papilloma virus |
| HNSCC |
Head and neck squamous cell carcinoma |
| ICD-O |
International classification of
disease for oncology |
| IHC |
Immunohistochemistry |
|
Top |
K...O
| KAT |
Potassium Antimony Tartrate |
| LFL |
Li-Fraumeni like syndrome |
| LFS |
Li-Fraumeni syndrome |
| LN |
Lymph node |
| MTX |
methotrexate |
| MDE |
Hydrolink mutation detection
enhancement |
| MDS |
Myelodysplastic syndrome |
| MGO |
Male Genital Organs |
| Moderately dif. |
Moderately differentiated |
| MMR |
MisMatch Repair |
| MSI |
Microsatellite instable/td>
|
| MSS |
Microsatellite stable/td>
|
| Mx |
Presence of distant metastasis
according to the TNM classification |
| NA |
not applicable/ not available |
| ND |
not done |
| NDBL |
Non-DNA-binding loop |
| neg. |
negative |
| NES |
Nuclear Export Signal |
| NIM |
nimustine |
| NLS |
Nuclear Localization Signal |
| NHL |
Non-Hodgkin’s lymphoma |
| NOS |
Not otherwise specified |
| NOS2 |
Nitric oxide synthase 2 |
| NSCLC |
Non-small cell lung cancer |
| Nx |
Extent of regional LN metastasis
according to the TNM
classification |
| OXA |
oxaliplatin |
|
Top |
P...T
| PJS |
Peutz-Jeghers Syndrome |
| Poorly dif. |
Poorly differentiated |
| pos. |
positive |
| PR |
Progesterone Receptor |
| RA, RAEB, RAS |
Classification for Myelodysplastic
syndrome |
| RAD |
Radiotherapy |
| RFLP |
Restriction fragment length polymorphism |
| RSP |
Residual score profile. ? |
| SCLC |
Small cell lung cancer |
| SSCP |
Single strand conformation
polymorphism |
| TAX |
taxol |
| TGGE |
Temperature gradient gel
electrophoresis |
| TOP |
topotecan |
| Tx |
Extent of primary tumor according to
the TNM classification |
| TxNxMx |
TNM
classification (tumor stage, presence or absence of LN
metastasis, presence or absence of distant metastasis) |
|
Top |
U...Z
| Undif. |
Undifferentiated |
| VIN |
vinblastine |
| VINC |
vincristine |
| VM26 |
teniposide |
| Well dif. |
Well differentiated |
| XP |
Xeroderma
pigmentosum |
| Yeast |
Yeast assay |
| yo |
Years old |
|
|