
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Spotlight on Molecular Profiling
Detailed DNA methylation profiles of the E-cadherin promoter in the NCI-60 cancer cells
1 Genomics and Bioinformatics Group, Laboratory of Molecular Pharmacology, Center for Cancer Research, National Cancer Institute, NIH, Bethesda, Maryland; 2 Gene Logic, Gaithersburg, Maryland; 3 Science Applications International Corporation-Frederick, Inc., National Cancer Institute at Frederick, Frederick, Maryland; and 4 Epigenetics Unit, Department of Medicine, Johns Hopkins University School of Medicine, Baltimore, Maryland
Requests for reprints: William C. Reinhold, Laboratory of Molecular Pharmacology, Center for Cancer Research, National Cancer Institute, NIH, Building 37, Room 5056, Bethesda, MD 20892-4255. Phone: 301-496-9572; Fax: 301-402-0752. E-mail: wcr{at}mail.nih.gov
Abstract
E-cadherin (E-cad) is a transmembrane adhesion glycoprotein, the expression of which is often reduced in invasive or metastatic tumors. To assess E-cad's distribution among different types of cancer cells, we used bisulfite-sequencing for detailed, base-by-base measurement of CpG methylation in E-cad's promoter region in the NCI-60 cell lines. The mean methylation levels of the cell lines were distributed bimodally, with values pushed toward either the high or low end of the methylation scale. The 38 epithelial cell lines showed substantially lower (28%) mean methylation levels compared with the nonepithelial cell lines (58%). The CpG site at -143 with respect to the transcriptional start was commonly methylated at intermediate levels, even in cell lines with low overall DNA methylation. We also profiled the NCI-60 cell lines using Affymetrix U133 microarrays and found E-cad expression to be correlated with E-cad methylation at highly statistically significant levels. Above a threshold of
20% to 30% mean methylation, the expression of E-cad was effectively silenced. Overall, this study provides a type of detailed analysis of methylation that can also be applied to other cancer-related genes. As has been shown in recent years, DNA methylation status can serve as a biomarker for use in choosing therapy. [Mol Cancer Ther 2007;6(2):391403]
Introduction
E-cadherin (E-cad) is a transmembrane glycoprotein normally expressed in the plasma membranes of epithelial cells, in which it mediates homophilic, Ca2+-dependent intercellular adhesion in adherens junctions (1). It interacts with the intracellular
, ß, and
catenins (2) and, through those molecules, is connected to the actin cytoskeleton. It acts as a tumor and invasion suppressor (3, 4), the loss of which has been associated with tumorigenesis (5) and increased metastatic potential (6, 7). E-cad is down-regulated in a wide variety of tumors originating from epithelial cells (5, 810). Its loss is an indicator of poor prognosis in both breast (11) and prostate (7, 12) cancers.
Multiple mechanisms can reduce E-cad expression. Silencing or reduction of expression has been associated with germ line mutations (13, 14), single nucleotide polymorphisms (15), frame shift and splice site mutations (16, 17), gene deletion (at 16q22.1; ref. 18), and epigenetic events such as histone deacetylation (19), chromatin condensation (20), and promoter region methylation in epithelial tumors (8, 9, 16, 21). Such epigenetic modifications can play important roles in cancer initiation and progression. Those modifications include global or gene-specific promoter region hypomethylation or hypermethylation, chromatin modification, and loss of imprinting (22). Promoter region hypermethylation sometimes provides the "second hit" on a remaining intact allele (23) in the context of Knudson's two-hit model of tumor suppressor gene inactivation. E-cad silencing associated with promoter region methylation was first described in gastric cancer (10).
The E-cad gene has a CpG island that includes the promoter, exon 1, intron 1, and exon 2 (24). That island is methylated in some epithelial tumors (25). A general association between methylation status and transcript level of E-cad in epithelial tumors has been established for renal (9, 26), bladder (21), and prostate cancers (8). In nonepithelial cell types, the role of E-cad is varied. Glial cells and leukocytes generally do not express E-cad. However, in melanocytes, cell-cell relationships in the skin are determined in part by E-cad (27). In skin, normal melanocytes interact with keratinocytes (28). During the transition to melanoma, E-cad tends to be lost, with concurrent increased expression of N-cadherin, resulting in increased communication between melanoma cells, increased communication between melanocytes and fibroblasts, and loss of association between melanocytes and keratinocytes (2830).
The current study makes use of the NCI-60 panel that consists of 60 diverse human cancer cell lines which have been used by the National Cancer Institute's Developmental Therapeutics Program to screen and profile >100,000 chemically defined compounds (plus a large number of natural product extracts) since 1990 (31, 32). Included are 38 epithelial and 22 nonepithelial lines derived primarily from patients with advanced and/or metastatic disease. In large part because of the link to molecular pharmacology and drug discovery, the NCI-60 have been more extensively and diversely profiled at the molecular level than any other set of cells in existence (3340). In November 2006, Molecular Cancer Therapeutics launched a new series under the rubric "Spotlight on Molecular Profiling" with three articles on molecular characterization of the NCI-60 (4143).
Here, in the context of the Spotlight series, we present detailed profiles of E-cad methylation in the NCI-60 cell lines obtained by the "gold-standard" bisulfite DNA sequencing method. For those studies, we designed PCR primers such that the amplicon would include all 25 CpG sites in the E-cad "minimal promoter region," 191 to +94 bp relative to the transcriptional start (10), plus four additional CpG's at the 3' end. In addition, we profiled E-cad expression levels in the NCI-60 using Affymetrix U133 microarrays and compared the results with those for E-cad methylation. We were not surprised to find a statistically highly significant negative correlation between the two. We were surprised, however, by the shape of the relationship, which was L-shaped; there seemed to be a "turn-off" methylation threshold level of
20% to 30% above which E-cad expression is essentially abolished.
Materials and Methods
Cell Lines
The NCI-60 cell lines were obtained from the NCI Developmental Therapeutics Program5 and cultured as described previously (36). Briefly, they were thawed from frozen stocks and cultured in RPMI 1640 (Cambrex, Walkersville, MD) with 5% FCS (Atlantic Biologicals, Norcross, GA) and 2 mmol/L of glutamine (Life Technologies, Inc., Rockville, MD). They were grown to
80% confluence in T-175 flasks (the last 24 h in fresh medium) before harvest.
RNA and DNA Isolation
RNA was isolated as described previously (36). Briefly, total RNA was purified using the RNeasy purification kit (Qiagen, Inc., Valencia, CA) according to the instructions of the manufacturer. Genomic DNA was purified from cells using the QIAamp DNA Blood Maxi kit (Qiagen) according to the instructions of the manufacturer. Samples were resuspended in 10 mmol/L of Tris and 1 mmol/L of EDTA (pH 8.0). Purified DNA was quantitated by spectrophotometry and aliquoted for storage at 80°C.
U133 Affymetrix Microarray Analysis of Transcript Expression
U133 A and B chips provide analysis of 22,215 features (including
14,500 known genes) and 22,577 features (including 9,606 known genes), respectively. Robust Multichip Analysis was used to process the data. The expression profiling was done in collaboration with U. Scherf, D. Dolginow, and colleagues at Gene Logic, Inc. (Gaithersburg, MD). The methods and the results for all genes on the A-chip are described elsewhere.6
Sodium Bisulfite DNA Modification
Genomic DNA (5 µg) from each cell line was treated with sodium bisulfite at 50°C for 17 h using the CpGenome DNA Modification kit from Chemicon International (Temecula, CA) according to the instructions of the manufacturer (except for a 5x volume scale-up through the first washing step on day 2). The DNA was then resuspended in 125 µL of 10 mmol/L Tris with 1 mmol/L of EDTA (pH 7.4; K-D Medical, Columbia, MD). The protocol produced enough material for
100 PCR sequencing reactions.
PCR Amplification and Sequencing
Nested PCR amplification and sequencing of the DNA were carried out using either converted or unconverted DNA as template for the PCR. Primers were based on the published consensus E-cad promoter DNA sequence (GenBank accession no. L34545). Two pairs of primers were used. For the bisulfite-converted DNA, the first pair consisted of E-cad-nest1 GATTTTAGGTTTTAGTGAGTT upstream (sequence position 397 to 377) and E-cad-nest2/4 GGAAACAGCTATGACCATGAA CTCCAAAAACCCATAACTAA downstream (sequence position 6 to +16). These were the outer primers used to anneal and amplify a 413 bp fragment of deaminated DNA in the first round of PCR. The second pair, E-cad-nest3 GTAAAACGACGGCCAGTTATTTAGATTTTAGTAATTTT (upstream, sequence position 319 to 299) and E-cad-nest4 (same as E-cad-nest2) inner primers (with 5' m13 tails) were then used to amplify a smaller (335 nucleotide) but higher-quality product. For the unconverted DNA, the same locations of primers were used, with E-cad-nest1 GATCCC AGGTCTTAGTGAGCC, E-cad-nest2/4 GGAAACAGCTATGACCATGTTCTCCAAGGGCCCATG GCTAA, and E-cad-nest3 GTAAAACGACGGCCAGCCACCTAGACCCTAGCAACTCC. The primers did not contain CpG's and thus would be expected to amplify the DNA without regard to its methylation status. Their design assumed complete C-to-T conversion after bisulfite-treatment. One-strand automated sequencing of the PCR products was done.
Analysis and Visualization of Sequences Using MethMiner
Because no available software satisfied our requirement for high-throughput analysis and visualization of the bisulfite sequencing results, we developed the MethMiner program package.7 MethMiner is a multifunctional tool that (a) determines the levels of CpG and non-CpG cytosine methylation, (b) incorporates sequence information from nonconverted DNA into the assessment of methylation, (c) creates aligned sequence representations, (d) creates single-page depictions of CpG and non-CpG methylation patterns, and (e) provides numerical representations of methylation status for statistical analysis. MethMiner therefore serves as an aid to the identification of patterns in the many hundreds of sequence reads generated by this project. However, it was not designed to do quality control for automated identification of sequencing errors or mutations, except insofar as they are suggested by visualization of the cytosine by cytosine patterns (see below). We did the essential quality control steps by going manually through each sequence tracing, then comparing the results with those from bisulfite sequencing of corresponding DNA not treated with bisulfite and with the normal human sequence from the Entrez Nucleotide public database.8
The input to MethMiner included both chromatograph trace data and sequence information. After multiple alignment of the sequences using Clustal-W (version 1.74) software,9 and uploading of peak information from the sequence traces, we used MethMiner to analyze the levels of C-to-T conversion [expressed as C / (C + T) ratios] for CpG and non-CpG cytosines. The tracing for each sequencing reaction was inspected visually for obvious flaws, including small peak size, lack of peak separation, and high background. The sequences were sorted first by cell line, then by reaction set. That is, sequence reads were grouped if they had both the same bisulfite conversion date and the same sequencing date. If there were multiple sequencing reads in one of those groups, we calculated a group mean value for each of the 29 CpG methylation levels. The final value for methylation level of a CpG site in a particular cell line was then taken (without correction for incomplete conversion) as the mean of the group mean values. Figure 2 visually indicates the reproducibility of the procedure for a set of reads (38 in all) representing different dates of bisulfite reaction and chromatographic sequencings for one of the cell lines.
|
|
To test whether different CpG sites showed different distributions of methylation, we applied the Kolmogorov-Smirnov test to each pair of sites. Because many tests were being done in parallel, a multiple-comparisons correction was made. To estimate the joint distribution of Kolmogorov-Smirnov statistics, under the assumption of no differences in distribution, we filled in a 60 x 29 matrix of values by sampling randomly from the combined distributions of all CpG sites. We then computed the Kolmogorov-Smirnov statistics for all 29 x 28 / 2 possible comparisons and saved the maximum of those values. The procedure was repeated 10,000 times to estimate the null distribution of maximal Kolmogorov-Smirnov statistics under the assumption of no differences in distribution.
Results
DNA Methylation Profiles
We used bisulfite sequencing to assess the methylation profiles of E-cad in the NCI-60 cell lines. The mean background level of non-CpG C-to-T conversion after the bisulfite reaction (over all data that passed quality control tests) was 95.4 ± 2.4% (mean ± SD), indicating highly efficient chemical conversion (data not shown). Table 1 presents the percentage of methylation, 100% x C / (C + T), for each of the 29 CpG sites for each of the NCI-60 cell lines, as well as their mean. Included among the NCI-60 are nine tissue-of-origin types: breast (BR), central nervous system glial (CNS), colon (CO), nonsmall cell lung, ovarian (OV), prostate (PR), and renal (RE) cancers plus leukemias (LE) and melanomas (ME). Overall, the cell lines showed a wide range (from 5% to 99%) of mean methylation levels over the 29 CpG sites. Mean methylation levels for the entire NCI-60, the epithelial cell lines, and the nonepithelial cell lines were 40%, 28%, and 58%, respectively. For the purposes of this study, the cell line MDA-MB435 and its ERBB2-transfectant derivative, MDA-N, were classified as melanomas despite the fact that MDA-MB435 was apparently obtained from the pleural effusion of a patient with breast cancer. MDA-MB435 has been reported to express milk fat proteins and cytokeratin markers characteristic of epithelial cells (46). However, we have found that the two cell lines are extraordinarily similar to the five NCI-60 melanotic melanomas in their profiles of sensitivity to thousands of drugs in the NCI screen (36), their transcript expression profiles (as assessed using six different microarray and RT-PCR platforms; refs. 36, 4751), and their protein expression profiles as assessed using two-dimensional gels (34) and reverse-phase lysate arrays (52, 53). Independent evidence supporting melanocytic origin has now been presented by others (54). Despite its original classification as MCF7 breast cancerderived, OVCAR-8/ADR will be considered here as ovarian in origin because of compelling evidence from our karyotypic analyses (35, 55) that it is a (drug-resistant) derivative of OVCAR-8. That conclusion has been corroborated by our gene expression studies and by our analyses of single nucleotide polymorphisms (43).
Figure 1A and B show two visualizations of CpG methylation of the E-cad promoter region. Both visualizations were generated by the MethMiner program package.7 Figure 1A is a base by base visualization of the methylation status of the NCI-60 for the portion of the promoter that contains CpG site nos. 5 to 10. Those CpG sites appear as the vertical stripes colored according to the figure legend (also see Supplemental Fig. S1 in which the non-CpG converted cytosines appear as vertical strips).11 In Fig. 1B, the CpG methylation profiles are summarized graphically for all 29 sites sequenced. The technical reproducibility of our bisulfite-sequencing protocol is indicated in Fig. 2 . Thirty-eight sequencing reactions from four independent bisulfite conversions, sequenced over a 67-day period, are displayed. The mean SD of the % methylation over the 29 CpG sites is 6.7, and the root mean square error is 7.8.
|
|
|
E-cad Expression Is Correlated with E-cad Methylation and Is Silenced Above
20% to 30% Methylation
Table 1 presents the E-cad transcript levels as measured by the Affymetrix U133 chip type.7 The mean methylation and expression patterns for E-cad correlated inversely at statistically significant levels (bootstrap two-tailed P < 0.05). Figure 5
, which summarizes the comparison of E-cad transcript and methylation patterns, shows an "L-shaped" relationship between mean percentage of methylation and mRNA expression. The transcript expression level is undetectable once the methylation level increases to >20% to 30%. When the level of methylation is below that level, the full range of E-cad expression levels is observed. The threshold and L-shape remain clear-cut if E-cad expression is represented on a linear, rather than logarithmic, scale. As we report elsewhere,12 several other gene expression platforms yield the same L-shaped relationship.
|
In this study, we assessed the promoter region CpG methylation profiles of E-cad across the NCI-60 cell lines, and then correlated the data with E-cad transcript expression. Because it proved almost impossible to analyze the patterns of methylation in that large amount of data (241 interpretable sequences), we developed the MethMiner program package.7 MethMiner aligns the sequences, produces several different types of color-coded graphics for pattern discernment (e.g., Fig. 1A and B), performs mathematical calculations, and facilitates checks of the four-color sequencer tracings for quality control.
After preliminary analysis of the data, we first asked how mean methylation levels differed from cell type to cell type. The most striking difference was that between epithelial cells (28% on average) and nonepithelial cells (58% on average), presumably reflecting the differences inherent in their normal counterparts. As indicated in Fig. 1B, there were also differences among organs of origin unrelated to the epithelial-nonepithelial dichotomy.
We next analyzed the distribution of methylation across the NCI-60 and found it to be bimodal (at both cell-mean and individual-site levels; Fig. 3A, B, and C). That observation is consistent with the concept that either high or low (but not intermediate) methylation may be the most stable genomic state (56).
We also asked if there were differences among the 29 individual CpG sites in their methylation levels. The answer seems to be yes. CpG site no. 4 (cytosine-143) stands out as being more highly methylated in many of the cell lines that otherwise have moderate to low levels of mean methylation. The Kolmogorov-Smirnov test with multiple comparisons correction yielded a statistically robust difference (P < 0.05) in pattern for CpG site no. 4. Cell lines with
25% mean methylation (close to the E-cad expression threshold from Fig. 5) have a mean methylation level of 12.9%, whereas site no. 4 for the same cell lines has a mean of 41.2%, with a low value of 12%. The dichotomy becomes even more pronounced for cell lines with the lowest (
10%) levels of methylation, 9 of 13 of which show measurable E-cad expression levels (Fig. 3). For those cells, an overall mean methylation level of 7.7% contrasts with the site no. 4 mean of 40.5%. Those findings are consistent with the concept of a "seeding" CpG site for E-cad that is methylated prior to other sites (57, 58), even in the presence of active transcription.
After the foregoing analysis of the methylation patterns themselves, we looked more closely to see how those patterns relate to E-cad transcript expression. Pearson correlation of the mean methylation pattern with E-cad transcript expression (Table 1) was quite strongly negative, at 0.38 (bootstrap P < 0.01). Because the Pearson correlation coefficient is a measure of linear association, it underestimates the degree of association, given the L-shape of the profile (Fig. 5). The approximately 13 cell lines with detectable E-cad expression (Table 1; Fig. 5) have methylation levels that range from 6% to 21%. E-cad transcript is not detectable in any of the cell lines with higher levels of methylation. Those findings suggest either that active E-cad transcription suppresses more extensive methylation within the promoter or that transcription is strongly inhibited by higher methylation. We had expected the general negative correlation between methylation and expression, in accord with the extensive literature on gene silencing, but the apparent threshold was a surprise.
We next asked whether any of the 29 individual E-cad CpG sites were especially predictive of E-cad expression level. Twenty-eight out of the 29 were statistically significant in their negative correlation with expression, the exception being site no. 4 (data not shown).
The current study doesn't attempt to address the normal versus cancer or normal versus normal question. Rather, we view the current work as a profiling study of E-cad methylation across the NCI-60 cancer cells. Accrual of differences in methylation during carcinogenesis is well accepted in the field, having been studied by several other groups, and documented extensively (5, 6, 810, 16, 21, 44, 59).
In conclusion, this review provides a detailed profile of the promoter region methylation status of E-cad in the NCI-60 cell lines and delineates the relationship of that methylation to silencing of the gene. It reports a bimodal distribution of methylation and a major difference in methylation between epithelial and nonepithelial cancer cells types. It also indicates that CpG site no. 4 is partially methylated in cell lines that both transcribe E-cad and have low overall methylation levels, consistent with the idea that site no. 4 is a seed for the methylation process, and perhaps that there is differential methylation of the two DNA strands. Analysis of the association between promoter region methylation and expression of E-cad led to the novel finding of an apparent threshold at
20% to 30% methylation beyond which E-cad expression is effectively silenced. Based on the results, we have analyzed the relationship of E-cad methylation and expression to the sensitivity of compounds tested in the NCI-60 screen. The results will be presented separately.13 Overall, this study provides a type of detailed analysis of promoter region methylation that can be applied to additional cancer-related genes. The implications for therapy are clear in that the DNA methylation states of individual genes have proved useful as biomarkers for individualization of therapy (60, 61).
Footnotes
Grant support: In part by the Intramural Research Program of the NIH, National Cancer Institute, Center for Cancer Research, and in part by the NCI under contract no. NO1-CO-12400.
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
Disclaimer: By acceptance of this article, the publisher or recipient acknowledges the right of the United State Government to retain a nonexclusive, royalty-free license and to any copyright covering the article. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organization imply endorsement by the U.S. Government.
5 See http://dtp.nci.nih.gov/. ![]()
6 U. Shankavaram, W. Reinhold, S. Nishizuka, et al. Transcript and protein expression profiles of the NCI-60 cancer cell panel: an integromic microarray analysis. Mol Cancer Ther 2006. Submitted for publication. ![]()
7 S. Kim, manuscript in preparation. ![]()
8 Available from: http://www.ncbi.nlm.nih.gov/entrez/. ![]()
9 Available from: http://molbio.info.nih.gov. ![]()
10 Available from: http://www.r-project.org/. ![]()
11 Supplementary material for this article is available at Molecular Cancer Therapeutics Online (http://mct.aacrjournals.org/). ![]()
12 Reinhold et al., in preparation. ![]()
13 Reinhold et al., manuscript in in preparation. ![]()
Received 10/ 3/06; revised 11/27/06; accepted 12/19/06.
References
This article has been cited by other articles:
![]() |
R. A. Irizarry, C. Ladd-Acosta, B. Carvalho, H. Wu, S. A. Brandenburg, J. A. Jeddeloh, B. Wen, and A. P. Feinberg Comprehensive high-throughput arrays for relative methylation (CHARM) Genome Res., May 1, 2008; 18(5): 780 - 790. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. S. Kristensen, T. Mikeska, M. Krypuy, and A. Dobrovic Sensitive Melting Analysis after Real Time- Methylation Specific PCR (SMART-MSP): high-throughput and probe-free quantitative DNA methylation detection Nucleic Acids Res., April 1, 2008; 36(7): e42 - e42. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |