an integrated database of chemosensitivity to 55 ...[cancer research 62, 1139–1147, february 15,...

10
[CANCER RESEARCH 62, 1139 –1147, February 15, 2002] An Integrated Database of Chemosensitivity to 55 Anticancer Drugs and Gene Expression Profiles of 39 Human Cancer Cell Lines 1 Shingo Dan, Tatsuhiko Tsunoda, Osamu Kitahara, Rempei Yanagawa, Hitoshi Zembutsu, Toyomasa Katagiri, Kanami Yamazaki, Yusuke Nakamura, and Takao Yamori 2 Division of Molecular Pharmacology, Cancer Chemotherapy Center, Japanese Foundation for Cancer Research, Toshima-ku, Tokyo 170-8455 [S. D., K. Y., T. Y.]; Laboratory of Molecular Medicine, Human Genome Center, Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo 108-8639 [O. K., R. Y., H. Z., T. K., Y. N.]; and SNP Research Center, RIKEN (Institute of Physical and Chemical Research), Minato-ku, Tokyo 108-8639 [T. T.], Japan ABSTRACT To explore genes that determine the sensitivity of cancer cells to anticancer drugs, we investigated using cDNA microarrays the expression of 9216 genes in 39 human cancer cell lines pharmacologically characterized on treatment with various anticancer drugs. A bioinformatical approach was then ex- ploited to identify genes related to anticancer drug sensitivity. An integrated database of gene expression and drug sensitivity profiles was constructed and used to identify genes with expression patterns that showed significant cor- relation to patterns of drug responsiveness. As a result, sets of genes were extracted for each of the 55 anticancer drugs examined. Whereas some genes commonly correlated with various classes of anticancer drugs, other genes correlated only with specific drugs with similar mechanisms of action. This latter group of genes may reflect the efficacy of each class of drugs. Therefore, the integrated database approach of gene expression and chemosensitivity profiles may be useful in the development of systems to predict drug efficacies of cancer cells by examining the expression levels of particular genes. INTRODUCTION Despite enormous efforts spent in the development of cancer chemo- therapies, often these therapies are effective only in a relatively small proportion of cancer patients. Whereas it has been long recognized that the effectiveness of anticancer drugs can vary significantly among indi- vidual patients, the same protocols are often applied without consider- ation of the different cancer cell characteristics of each patient. Because at present, there is no way to predict the effectiveness of a cancer chemotherapy for each case, cancer patients are still treated with the same protocols despite the possibility that the treatment may not be appropriate. Two major factors need to be considered when regarding the efficacy of particular anticancer drugs: (a) local blood or tissue drug concentra- tions can be influenced by the activities of metabolic enzymes, which may activate or inactivate the drug, and actions of drug transporters; and (b) differences in individual drug efficacies may also reflect the intrinsic susceptibility of cancer cells to those anticancer drugs. In the latter case, a number of genes has been reported to influence chemosensitivity, e.g., cancer cells expressing high levels of P-glycoprotein encoded by the ABCB1 (formerly MDR-1) gene are often resistant to a large subset of anticancer drugs, including vincristine, etoposide, and paclitaxel (taxol; Ref. 1). However, it has become obvious that the susceptibility of cancer cells to particular anticancer drugs cannot be predicted by a single factor but is determined by many factors that influence overall sensitivity. Therefore, to establish an appropriate protocol for the prediction of chemosensitivity, we need to accumulate information on the sets of genes that pharmacologically characterize cancer cells on treatment with par- ticular anticancer drugs. Taking advantage of a panel of 39 well-characterized human cancer cell lines (2) and a cDNA microarray system consisting of 9216 genes (3), we attempted to reveal genes associated with cancer cell chemo- sensitivity. Here, we report the construction of an integrated database of gene expression profiles and drug sensitivity patterns for 39 human cancer cell lines and the identification of gene sets that are likely to be involved in chemosensitivity. MATERIALS AND METHODS Cell Lines and Cell Culture. The human cancer cell line panel has been described previously (2) and consists of the following 39 human cancer cell lines: lung cancer, NCI 3 -H23, NCI-H226, NCI-H522, NCI-H460, A549, DMS273, and DMS114; colorectal cancer, HCC-2998, KM-12, HT-29, HCT- 15, and HCT-116; gastric cancer, MKN-1, MKN-7, MKN-28, MKN-45, MKN-74, and St-4; ovarian cancer, OVCAR-3, OVCAR-4, OVCAR-5, OVCAR-8, and SK-OV-3; breast cancer, BSY-1, HBC-4, HBC-5, MDA-MB- 231, and MCF-7; renal cancer, RXF-631L and ACHN; melanoma, LOX- IMVI; glioma, U251, SF-295, SF-539, SF-268, SNB-75, and SNB-78; and prostate cancer, DU-145 and PC-3. All cell lines were cultured in RPMI 1640 supplemented with 5% fetal bovine serum, penicillin (100 units/ml), and streptomycin (100 mg/ml) at 37°C in humidified air containing 5% CO 2 . The cell line panel has been used in anticancer drug screening programs at the Japanese Foundation for Cancer Research (2) and for modeling systems at the Developmental Therapeutics Program of the NCI (4) and has been exam- ined for susceptibility to a number of chemical compounds, including current anticancer drugs. Growth Inhibition Assay and Data Processing. Growth inhibition was assessed as changes in total cellular protein after 48 h of drug treatment using a sulforhodamine B assay. The GI 50 was calculated as described previously (2, 4). Identification of Gene Expression Profiles by cDNA Microarray. Cell lines grown as monolayers to log phase were washed twice with PBS, and total RNA was extracted with TRIzol reagent (Life Technologies, Inc.). After treatment with DNase I (Boehringer Mannheim) to remove contaminating genomic DNA, poly(A) RNA was extracted using an mRNA Purification Kit (Amersham Pharmacia Biotech). Probe cDNA-labeling reactions for each sample were per- formed as described previously (5), except that an oligodeoxythymidylate primer was used instead of random hexamers. Each sample was reverse transcribed in the presence of Cy5-labeled dCTP. A mixture of mRNA from all 39 cell lines was prepared as the control probe. The mRNA mixture was amplified by T7-based amplification and reverse transcribed in the presence of Cy3-labeled dCTP, as described previously (5–7). Sample and control-labeled probes were mixed to- gether and hybridized to cDNA microarray slides that contained 9216 human genes (3). Hybridized slides were scanned, and the fluorescence intensities of Cy5 (sample) and Cy3 (control) for each gene spot quantified by using a GenIII microarray scanner (Amersham Pharmacia Biotech) with Array Vision software (Imaging Research, Inc.). Each slide contained 52 housekeeping genes to normal- ize the signal intensities of the fluorescent dyes. The intensities of Cy5 and Cy3 were adjusted so that the mean Cy5 and Cy3 intensities of probe cDNA binding to the housekeeping genes were equivalent. Classification of Anticancer Drugs According to Drug Activity Pattern Against Human Cancer Cell Lines. A hierarchical clustering method was applied to the chemosensitivity data. Before applying the clustering algorithm, Received 7/2/01; accepted 12/10/01. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked advertisement in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. 1 Supported by Grant-in-Aid for Scientific Research on Priority Areas (C) from the Ministry of Education, Culture, Sports, Science and Technology, Japan. 2 To whom requests for reprints should be addressed, at Division of Molecular Pharmacology, Cancer Chemotherapy Center, Japanese Foundation for Cancer Research, 1-37-1 Kami-Ikebukuro, Toshima-ku Tokyo 170-8455, Japan. Phone: 81-3-5394-4068; Fax: 81-3-3918-3716; E-mail: [email protected]. 3 The abbreviations used are: NCI, National Cancer Institute; GI 50 , drug concentration required for 50% growth inhibition; topo, DNA topoisomerase; TYMS, thymidylate synthetase. 1139 on June 4, 2020. © 2002 American Association for Cancer Research. cancerres.aacrjournals.org Downloaded from

Upload: others

Post on 31-May-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: An Integrated Database of Chemosensitivity to 55 ...[CANCER RESEARCH 62, 1139–1147, February 15, 2002] An Integrated Database of Chemosensitivity to 55 Anticancer Drugs and Gene

[CANCER RESEARCH 62, 1139–1147, February 15, 2002]

An Integrated Database of Chemosensitivity to 55 Anticancer Drugs and GeneExpression Profiles of 39 Human Cancer Cell Lines1

Shingo Dan, Tatsuhiko Tsunoda, Osamu Kitahara, Rempei Yanagawa, Hitoshi Zembutsu, Toyomasa Katagiri,Kanami Yamazaki, Yusuke Nakamura, and Takao Yamori2

Division of Molecular Pharmacology, Cancer Chemotherapy Center, Japanese Foundation for Cancer Research, Toshima-ku, Tokyo 170-8455 [S. D., K. Y., T. Y.]; Laboratory ofMolecular Medicine, Human Genome Center, Institute of Medical Science, The University of Tokyo, Minato-ku, Tokyo 108-8639 [O. K., R. Y., H. Z., T. K., Y. N.]; and SNPResearch Center, RIKEN (Institute of Physical and Chemical Research), Minato-ku, Tokyo 108-8639 [T. T.], Japan

ABSTRACT

To explore genes that determine the sensitivity of cancer cells to anticancerdrugs, we investigated using cDNA microarrays the expression of 9216 genesin 39 human cancer cell lines pharmacologically characterized on treatmentwith various anticancer drugs. A bioinformatical approach was then ex-ploited to identify genes related to anticancer drug sensitivity. An integrateddatabase of gene expression and drug sensitivity profiles was constructed andused to identify genes with expression patterns that showed significant cor-relation to patterns of drug responsiveness. As a result, sets of genes wereextracted for each of the 55 anticancer drugs examined. Whereas some genescommonly correlated with various classes of anticancer drugs, other genescorrelated only with specific drugs with similar mechanisms of action. Thislatter group of genes may reflect the efficacy of each class of drugs. Therefore,the integrated database approach of gene expression and chemosensitivityprofiles may be useful in the development of systems to predict drug efficaciesof cancer cells by examining the expression levels of particular genes.

INTRODUCTION

Despite enormous efforts spent in the development of cancer chemo-therapies, often these therapies are effective only in a relatively smallproportion of cancer patients. Whereas it has been long recognized thatthe effectiveness of anticancer drugs can vary significantly among indi-vidual patients, the same protocols are often applied without consider-ation of the different cancer cell characteristics of each patient. Becauseat present, there is no way to predict the effectiveness of a cancerchemotherapy for each case, cancer patients are still treated with the sameprotocols despite the possibility that the treatment may not be appropriate.

Two major factors need to be considered when regarding the efficacyof particular anticancer drugs: (a) local blood or tissue drug concentra-tions can be influenced by the activities of metabolic enzymes, whichmay activate or inactivate the drug, and actions of drug transporters; and(b) differences in individual drug efficacies may also reflect the intrinsicsusceptibility of cancer cells to those anticancer drugs. In the latter case,a number of genes has been reported to influence chemosensitivity, e.g.,cancer cells expressing high levels of P-glycoprotein encoded by theABCB1 (formerly MDR-1) gene are often resistant to a large subset ofanticancer drugs, including vincristine, etoposide, and paclitaxel (taxol;Ref. 1). However, it has become obvious that the susceptibility of cancercells to particular anticancer drugs cannot be predicted by a single factorbut is determined by many factors that influence overall sensitivity.Therefore, to establish an appropriate protocol for the prediction ofchemosensitivity, we need to accumulate information on the sets of genesthat pharmacologically characterize cancer cells on treatment with par-ticular anticancer drugs.

Taking advantage of a panel of 39 well-characterized human cancercell lines (2) and a cDNA microarray system consisting of 9216 genes(3), we attempted to reveal genes associated with cancer cell chemo-sensitivity. Here, we report the construction of an integrated databaseof gene expression profiles and drug sensitivity patterns for 39 humancancer cell lines and the identification of gene sets that are likely to beinvolved in chemosensitivity.

MATERIALS AND METHODS

Cell Lines and Cell Culture. The human cancer cell line panel has beendescribed previously (2) and consists of the following 39 human cancer celllines: lung cancer, NCI3-H23, NCI-H226, NCI-H522, NCI-H460, A549,DMS273, and DMS114; colorectal cancer, HCC-2998, KM-12, HT-29, HCT-15, and HCT-116; gastric cancer, MKN-1, MKN-7, MKN-28, MKN-45,MKN-74, and St-4; ovarian cancer, OVCAR-3, OVCAR-4, OVCAR-5,OVCAR-8, and SK-OV-3; breast cancer, BSY-1, HBC-4, HBC-5, MDA-MB-231, and MCF-7; renal cancer, RXF-631L and ACHN; melanoma, LOX-IMVI; glioma, U251, SF-295, SF-539, SF-268, SNB-75, and SNB-78; andprostate cancer, DU-145 and PC-3. All cell lines were cultured in RPMI 1640supplemented with 5% fetal bovine serum, penicillin (100 units/ml), andstreptomycin (100 mg/ml) at 37°C in humidified air containing 5% CO2. Thecell line panel has been used in anticancer drug screening programs atthe Japanese Foundation for Cancer Research (2) and for modeling systems atthe Developmental Therapeutics Program of the NCI (4) and has been exam-ined for susceptibility to a number of chemical compounds, including currentanticancer drugs.

Growth Inhibition Assay and Data Processing. Growth inhibition wasassessed as changes in total cellular protein after 48 h of drug treatment using asulforhodamine B assay. The GI50 was calculated as described previously (2, 4).

Identification of Gene Expression Profiles by cDNA Microarray. Celllines grown as monolayers to log phase were washed twice with PBS, and totalRNA was extracted with TRIzol reagent (Life Technologies, Inc.). After treatmentwith DNase I (Boehringer Mannheim) to remove contaminating genomic DNA,poly(A)� RNA was extracted using an mRNA Purification Kit (AmershamPharmacia Biotech). Probe cDNA-labeling reactions for each sample were per-formed as described previously (5), except that an oligodeoxythymidylate primerwas used instead of random hexamers. Each sample was reverse transcribed in thepresence of Cy5-labeled dCTP. A mixture of mRNA from all 39 cell lines wasprepared as the control probe. The mRNA mixture was amplified by T7-basedamplification and reverse transcribed in the presence of Cy3-labeled dCTP, asdescribed previously (5–7). Sample and control-labeled probes were mixed to-gether and hybridized to cDNA microarray slides that contained 9216 humangenes (3). Hybridized slides were scanned, and the fluorescence intensities of Cy5(sample) and Cy3 (control) for each gene spot quantified by using a GenIIImicroarray scanner (Amersham Pharmacia Biotech) with Array Vision software(Imaging Research, Inc.). Each slide contained 52 housekeeping genes to normal-ize the signal intensities of the fluorescent dyes. The intensities of Cy5 and Cy3were adjusted so that the mean Cy5 and Cy3 intensities of probe cDNA bindingto the housekeeping genes were equivalent.

Classification of Anticancer Drugs According to Drug Activity PatternAgainst Human Cancer Cell Lines. A hierarchical clustering method wasapplied to the chemosensitivity data. Before applying the clustering algorithm,

Received 7/2/01; accepted 12/10/01.The costs of publication of this article were defrayed in part by the payment of page

charges. This article must therefore be hereby marked advertisement in accordance with18 U.S.C. Section 1734 solely to indicate this fact.

1 Supported by Grant-in-Aid for Scientific Research on Priority Areas (C) from theMinistry of Education, Culture, Sports, Science and Technology, Japan.

2 To whom requests for reprints should be addressed, at Division of MolecularPharmacology, Cancer Chemotherapy Center, Japanese Foundation for Cancer Research,1-37-1 Kami-Ikebukuro, Toshima-ku Tokyo 170-8455, Japan. Phone: 81-3-5394-4068;Fax: 81-3-3918-3716; E-mail: [email protected].

3 The abbreviations used are: NCI, National Cancer Institute; GI50, drug concentrationrequired for 50% growth inhibition; topo, DNA topoisomerase; TYMS, thymidylatesynthetase.

1139

on June 4, 2020. © 2002 American Association for Cancer Research. cancerres.aacrjournals.org Downloaded from

Page 2: An Integrated Database of Chemosensitivity to 55 ...[CANCER RESEARCH 62, 1139–1147, February 15, 2002] An Integrated Database of Chemosensitivity to 55 Anticancer Drugs and Gene

the GI50 values were log transformed, and the absolute values (�log10GI50�)were used for correlation analysis. Correlations were assessed by the Pearsoncorrelation coefficient as described previously (2, 8). Analyses were performedusing “Gene Spring” software (Silicon Genetics).

Classification of Human Cancer Cell Lines According to Gene Expres-sion Profiles. A hierarchical clustering method was applied to the geneexpression data. To obtain reproducible clusters, we selected only the 991genes that passed the cutoff filter [control Cy3 signals were �50,000 relativefluorescent units in �80% of cases, i.e., 32 of 39 cell lines examined; thefluorescence ratio (Cy5:Cy3) was greater than the mean ratio by at least 5-foldin at least 3 of 39 cell lines]. Analysis was performed using Gene Spring.Before applying the clustering algorithm, the fluorescence ratio for each spotwas log transformed (log2Cy5:Cy3), and the data for each sample centered toremove experimental biases. Correlations were assessed by the Pearson cor-relation coefficient as described (9, 10).

Correlation Analysis between Gene Expression Profiles and Chemosen-sitivity Profiles. We calculated the degree of similarity between drug activityand gene expression pattern using the Pearson correlation coefficient by thefollowing formula:

r ��� xi � xm� � yi � ym�

�� xi � xm�2 �� yi � ym�2

where xi represented the log expression ratio (log2Cy5:Cy3) of gene x in celli, whereas yi was the log sensitivity (�log10GI50�) of cell i to drug y. xm

represented the mean of the log expression ratio of gene x, and ym representedthe mean sensitivity (�log10GI50�) of the drug. In this analysis, we selectedgenes that passed the cutoff filter (signal intensities were �25,000 relativefluorescent units or signal:noise ratios were �3 in either Cy3 or Cy5 in �80%of cases, i.e., 32 of 39 cell lines examined). We then selected genes withexpression patterns that showed significant correlation to drug activity pat-terns. A significant correlation was defined as a P � 0.05 and a slope of theregression line �1.5, where the difference of the �log10GI50� values betweenthe most and the least sensitive cell lines was fixed as 1.

Clustering Analysis of Drug and Gene Profiles on the Basis of PearsonCorrelation Coefficients between Drugs and Genes. We performed cluster-ing analysis on the drugs and genes on the basis of the Pearson correlationcoefficients, as described previously (10). We used all 1071 genes that passedthe selection criteria described in the previous section for at least 1 of 55 drugs.The algorithm used to calculate the correlation between drug a and drug b wasthe standard correlation coefficient by the following formula:

r ��aibi

��ai2 �bi

2

where ai represented the Pearson correlation coefficient between drug a andgene i, whereas bi was the coefficient between drug b and gene i.

Similarly, the algorithm used to calculate the correlation between gene c andgene d was the standard correlation coefficient by the following formula:

r ��cjdj

��cj2 �dj

2

where cj represented the Pearson correlation coefficient between gene c anddrug j, whereas dj was the coefficient between gene d and drug j.

RESULTS

Developing Gene Expression and Chemosensitivity DatabasesUsing 39 Human Cancer Cell Lines. We examined the growth in-hibitory effect of 55 current anticancer drugs (Table 1) on 39 humancancer cell lines used to evaluate the efficacy of novel anticancer com-pounds (2). The sensitivity of each of the 39 cell lines to each drug wasassessed by GI50. The GI50 values were log transformed, and the absolutevalues (�log10GI50�) were used for further analysis. Hierarchical cluster-ing was performed on the 55 anticancer drugs based on their effect on the39 cell lines (Fig. 1). As a result, the 55 drugs were classified into several

clusters, each consisting of drugs with similar modes of action, e.g., onecluster included topo I inhibitors, such as camptothecin, irinotecan, andSN-38. The second cluster consisted of doxorubicin and other anthracy-cline compounds. 5-fluorouracil and its derivatives also clustered into asingle group. Interestingly, actinomycin D, which is not known to act ontubulin, clustered with tubulin binders, including taxanes and Vincaalkaloids, suggesting actinomycin D might partly share the mode ofaction with these drugs. Indeed, both actinomycin D and Vinca alkaloidswere shown previously to inhibit RNA synthesis in the cells. Theseresults indicated that our system using the 39 cancer cell lines was ableto classify drugs on the basis of mechanism of action. On the other hand,clustering analysis performed on the cancer cell lines according to che-mosensitivity data did not always reflect the tissue of origin, in agreementwith previous findings (10).

We then performed cDNA microarray analysis to investigate theexpression profiles of 9216 genes in each of the 39 cell lines. Theexpression of each gene was assessed by the ratio of expression levelin the sample against a universal control, and the values log trans-

Table 1 Anticancer drugs used in analysis

Type Drug name Target/mode of action

Vinca alkaloids Camptothecin Topo IIrinotecan Topo ISN-38 Topo IVincristine TubulinVinblastine TubulinVindecine Tubulin

Taxanes Paclitaxel TubulinDocetaxel Tubulin

Antibiotics Actinomycin D RNA synthesisBleomycin DNA synthesisPeplomycin DNA synthesisMitoxantrone DNA synthesisMitomycin C DNA alkylator

Anthracyclines Epirubicin DNA synthesis/Topo IIDoxorubicin DNA synthesis/Topo IIDaunorubicin DNA synthesis/Topo IIPirarubicin DNA synthesis/Topo IIAclarubicin DNA/RNA synthesis

Antimetabolites Methotrexate DHFRa

10EdAM DHFR5-fluorouracil PyrimidineDoxifluridine PyrimidineCarmofur PyrimidineTegafur PyrimidineCytarabine PyrimidineGemcitabine PyrimidineEnocitabine Pyrimidine6-thioguanine Purine6-mercaptopurine Purine

DNA alkylators Cisplatin DNA cross-linkerCarboplatin DNA cross-linkerOxaliplatin DNA cross-linkerBusulfan DNA cross-linkerMelphalan DNA cross-linkerThiotepa DNA cross-linkerDacarbazine DNA synthesisCarboquone DNA synthesisNitrogen mustard DNA synthesis

Other DNA-damaging agents Etoposide Topo IIAmsacrine Topo IIGenistein Topo IIEllipticine Topo IIICRF154 Topo IINeocarzinostatin DNA synthesisCladribine DNA synthesisTomudex DNA synthesis

Hormone analogues Tamoxifen Estrogen receptorToremifene Estrogen receptorEstramustine Estradiol/nitrogen mustard

Interferons Interferon-� Biological response modifierInterferon-� Biological response modifierInterferon-� Biological response modifier

Others STI 571 Abl kinase inhibitorL-asparaginase Protein synthesisPSC833 P-glycoprotein

a DHFR, dihydrofolate reductase; 10EdAM, 10ethyl-lodeaza-aminopterin.

1140

CHEMOSENSITIVITY AND GENE EXPRESSION OF CANCER CELLS

on June 4, 2020. © 2002 American Association for Cancer Research. cancerres.aacrjournals.org Downloaded from

Page 3: An Integrated Database of Chemosensitivity to 55 ...[CANCER RESEARCH 62, 1139–1147, February 15, 2002] An Integrated Database of Chemosensitivity to 55 Anticancer Drugs and Gene

formed before analysis [log2 (sample)/(control)]. We then applied thehierarchical clustering method to the cell line data. Cell lines with thesame tissue of origin tended to form clusters, except for breast andovarian cancers (Fig. 2). We examined the expression profiles of thecell lines NCI-H23, OVCAR3, OVCAR5, and OVCAR8 twice usingindependent mRNA preparations and confirmed that the four cell linesclustered very closely with each other, indicating the reproducibilityof our experimental procedure (data not shown).

Correlation Analysis between Drugs and Genes Using an Inte-grated Database of Gene Expression and Chemosensitivity. Toscreen for genes that may be involved in chemosensitivity, we integratedthe two databases and performed correlation analysis between the geneexpression and drug sensitivity datasets. Comprehensive calculations forPearson correlation coefficients were performed on the gene expressionand drug sensitivity patterns of the 39 cell lines. Genes were selected thatexhibited significant correlations that satisfied the following criteria: a Pof the correlation �0.05 and a slope of the regression line �1.5, wherethe difference of the log10 GI50 values between the most and the leastsensitive cell lines was fixed as 1. As a result, different sets of genes wereextracted with respect to each of the 55 drugs tested (data not shown). Wethen examined for genes that were associated with �10 of the 55 drugs

Fig. 1. Two-dimensional hierarchical clustering was applied to growth inhibitory activity data of 55 anticancer drugs determined in 39 cell lines. Whereas drugs with similarmechanisms of action tended to cluster together, cell lines from the same tissue of origin did not. Cell line names are colored to reflect the tissue of origin. The tissue of origin of eachcell line is described in the abbreviated symbols: LU, lung cancer; CO, colorectal cancer; GA, gastric cancer; OV, ovarian cancer; BR, breast cancer; GL, glioma; RE, renal cancer; ME,melanoma; PR, prostate cancer. Colors on the imaging map represent the relative sensitivity of the cell line against the drug. Numbers beside the drug names refer to the differencein �log10GI50� from the mean �log10GI50� for each drug, e.g., the cell line with 3.0 (in red) represents 1000-fold more sensitivity than the mean.

Fig. 2. Hierarchical clustering was applied to cell lines on the basis of expression datafrom a set of 991 cDNAs measured across 39 cell lines. The 991 cDNAs were selectedfrom a total of 9216 genes by criteria described in “Materials and Methods.”

1141

CHEMOSENSITIVITY AND GENE EXPRESSION OF CANCER CELLS

on June 4, 2020. © 2002 American Association for Cancer Research. cancerres.aacrjournals.org Downloaded from

Page 4: An Integrated Database of Chemosensitivity to 55 ...[CANCER RESEARCH 62, 1139–1147, February 15, 2002] An Integrated Database of Chemosensitivity to 55 Anticancer Drugs and Gene

and found 50 genes that were significantly correlated to sensitivity to arelatively wide range of anticancer drugs (Fig. 3). Aldose reductase(AKR1B1), which catalyzes the conversion of glucose to sorbitol (11),was most commonly associated with drug sensitivity, revealing a signif-icant correlation with 24 drugs. Damage-specific DNA binding protein 2

(DDB2), involved in DNA repair triggered by UV radiation (12), showeda positive correlation to 20 drugs. These genes were suggested to becommon predictive markers of chemosensitivity. LIM domain kinase 2(LIMK2), involved in actin skeleton remodeling (13), showed a negativecorrelation to 18 drugs, suggesting that LIMK2 may be a common

Fig. 3. Genes that significantly correlated with multiple drugs. Genes that correlated with �10 drugs are shown. Multiplicity is noted on the left of the gene symbols; the order ofdrugs is the same as for Fig. 4. Red, a positive correlation; green, a negative correlation. The shade of the color represents the P of the correlation.

1142

CHEMOSENSITIVITY AND GENE EXPRESSION OF CANCER CELLS

on June 4, 2020. © 2002 American Association for Cancer Research. cancerres.aacrjournals.org Downloaded from

Page 5: An Integrated Database of Chemosensitivity to 55 ...[CANCER RESEARCH 62, 1139–1147, February 15, 2002] An Integrated Database of Chemosensitivity to 55 Anticancer Drugs and Gene

Fig. 4. Two-dimensional clustering analysis of genes and drugs on the basis of Pearson correlation coefficients (see “Materials and Methods”). Each row, a drug; each column, agene. Colors are same as used in Fig. 3. As for clustering analysis based on drug activity data, drugs with similar modes of action clustered.

1143

CHEMOSENSITIVITY AND GENE EXPRESSION OF CANCER CELLS

on June 4, 2020. © 2002 American Association for Cancer Research. cancerres.aacrjournals.org Downloaded from

Page 6: An Integrated Database of Chemosensitivity to 55 ...[CANCER RESEARCH 62, 1139–1147, February 15, 2002] An Integrated Database of Chemosensitivity to 55 Anticancer Drugs and Gene

Fig. 5. Gene sets that commonly correlated with drugs with similar mechanisms of action. Colors are same as used in Fig. 3. The drugs examined were as follows: camptothecin and itsderivatives irinotecan and SN-38 (A); bleomycin and its derivative peplomycin (B); anthracycline compounds doxorubicin, epirubicin, and daunorubicin (C); and 5-fluorouracil and its derivativescarmofur, tegafur, and doxifluridine (D). We selected genes with common significant correlations to two of two drugs (B) or two or more drugs of three (A and C) or four (D).

1144

CHEMOSENSITIVITY AND GENE EXPRESSION OF CANCER CELLS

on June 4, 2020. © 2002 American Association for Cancer Research. cancerres.aacrjournals.org Downloaded from

Page 7: An Integrated Database of Chemosensitivity to 55 ...[CANCER RESEARCH 62, 1139–1147, February 15, 2002] An Integrated Database of Chemosensitivity to 55 Anticancer Drugs and Gene

predictive marker of drug resistance. Interestingly, whereas overall pro-files of gene expression in cancer cells reflected their tissues of origin,these genes are not strongly associated with tissue of origin (data notshown).

Correlation of Gene Expression and Drug Sensitivity on theBasis of Pearson Correlation Coefficient between Genes andDrugs. To clearly identify correlations between gene expression pro-files and the sensitivity to anticancer drugs, we applied a hierarchicalclustering method on both drugs and genes, based on calculations ofPearson correlation coefficients between drug efficiency and geneexpression (Fig. 4). This data mining method was first developed byScherf et al. (10, 14). We used 1,071 genes that revealed significantcorrelations, by the criteria mentioned above, to at least one drug (see

“Materials and Methods,” data not shown). Over 50,000 (1,071 � 55)correlations were calculated and clustered. Similar to the clusteringanalysis performed using drug sensitivity data, sets of drugs withsimilar mechanisms of action tended to cluster with each other;however, some clusters were reorganized (Figs. 1 and 4). The resultindicated that the correlating gene sets tended to overlap with respectto drugs with similar mechanisms of action.

Putative Genes Involved in Chemosensitivity. We then searchedfor genes that correlated with clusters of drugs with similar mecha-nisms of action. For the 5-fluorouracil derivatives, we identified genesthat revealed significant correlations with at least two of the fourdrugs, 5-fluorouracil, carmofur, tegafur, and doxifluridine, as shownin Fig. 5D. Among the 51 genes that showed a negative correlation,

Fig. 5D. Continued.

1145

CHEMOSENSITIVITY AND GENE EXPRESSION OF CANCER CELLS

on June 4, 2020. © 2002 American Association for Cancer Research. cancerres.aacrjournals.org Downloaded from

Page 8: An Integrated Database of Chemosensitivity to 55 ...[CANCER RESEARCH 62, 1139–1147, February 15, 2002] An Integrated Database of Chemosensitivity to 55 Anticancer Drugs and Gene

the gene encoding cyclin C (CCNC) exhibited a broad negativecorrelation, not only with 5-fluorouracil derivatives but also with topoinhibitors, anthracyclines, antifolates, and tubulin binders. In contrast,the genes coding KIAA0097 protein and survivin (BIRC5) showedspecific correlation only to fluoropyrimidines. Genes with positivecorrelations included three aldo-keto reductase family members(AKR1C1, AKR1C3, and AKR1B11), two aldehyde dehydrogenases(ALDH1 and ALDH3), and galectin 4 (LGALS4, in duplicate).

We performed similar analyses with respect to other sets of drugs,such as topo I inhibitors (camptothecin, irinotecan, and SN-38; Fig.5A), bleomycin derivatives (bleomycin and peplomycin; Fig. 5B), andanthracyclines (doxorubicin, epirubicin, and daunorubicin; Fig. 5C).As a result, we generated different sets of genes that revealed positiveand negative correlations with respect to different clusters of drugs. Ofthese, topo I inhibitors, bleomycin derivatives, and anthracyclinesclosely clustered with each other on the basis of both drug activity andPearson correlation coefficient, suggesting these compounds act bysimilar mechanisms (Figs. 1 and 4), e.g., genes encoding 14-3-3 �(SFN), LIM domain kinase 2 (LIMK2), and cathepsin H (CTSH)commonly showed negative correlations, whereas genes encodingDDB2 (DDB2) and the ALL-fused gene from chromosome1q (AF1Q)revealed positive correlations (Fig. 5, A–C). However, we also ob-served some differences, e.g., keratins (KRT7–8 and -19) correlatedwith camptothecins and bleomycin derivatives but not with anthracy-clines, whereas expression of the gene encoding transducer of ErbB21 (TOB1) positively correlated with anthracycline but not with bleo-mycin or camptothecins. Our results suggested that drugs with similarmechanisms of action shared sets of genes involved in cancer cellsusceptibility and that these gene sets varied with respect to clusters ofdrugs with similar mechanisms of action.

DISCUSSION

In this study, we investigated the gene expression profiles of 39human cancer cell lines that were pharmacologically characterized ontreatment with various anticancer drugs, using a cDNA microarraysystem consisting of 9216 genes. In conjunction with chemosensitiv-ity data, we identified genes expressed in the 39 cell lines withexpression patterns that correlated significantly with the susceptibilityprofiles of each of the 55 anticancer drugs evaluated. Positivelycorrelating genes are likely to reflect chemosensitivity as the higherthe expression level of these genes, the lower the GI50. Similarly,genes that showed negative correlations are likely to reflect chemore-sistance. Thus, the expression levels of these genes may be useful aspredictive markers of drug effectiveness.

Our database, comprising both gene expression data and drugactivity data for the same set of cell lines, was first developed by Rosset al. and Scherf et al. (9, 10) using 60 human cancer cell lines(NCI60). These studies examined gene expression profiles in 60 celllines that were pharmacologically characterized on treatment withvarious kinds of anticancer drugs. The number of cell lines used in thepresent study was 39. Of these, 9 cell lines (6 derived from gastriccancers and 3 from breast cancers) were not included in the NCI60panel (4). Using this different set of 39 cell lines, we performedhierarchical clustering with respect to 55 anticancer drugs on the basisof growth inhibition and Pearson correlation coefficients betweendrugs and genes. As shown in Figs. 1 and 4, drugs with similarmechanisms of action clustered most of the time, in agreement withresults of an earlier study that used a different set of cell lines (10).Moreover, we have shown previously that the mode of action of anovel compound can be predicted by examining its growth inhibitoryactivities against the panel of 39 cell lines (2, 8). This indicated thatour cell line panel had sufficient power to classify drugs according to

mode of action. The previous study by Scherf et al. (10) showedexpected correlations, such as that between 5-fluorouracil and dihy-dropyrimidine dehydrogenasae (DPYD). A moderately negative cor-relation between 5-fluorouracil and TYMS was demonstrated in ourstudy (r 0.29, P � 0.1). In addition, a stronger negative corre-lation between 5-fluorouracil derivative carmofur and TYMS was alsoobserved (r 0.38, P � 0.01; data not shown). Furthermore,positive correlations between the DNA topo II inhibitor etoposide andDNA topo II � (TOP2A, r 0.39, P � 0.05) and � (TOP2B, r 0.29,P � 0.1) were also found (data not shown). Although the Pearsoncorrelation coefficients were not high, together these moderate corre-lations might be sufficient as the sensitivity of cancer cells to anti-cancer drugs appears not to be determined by the expression of asingle gene.

Clustering analysis of cell lines revealed that overall profiles ofgene expression in cancer cells reflected their tissues of origin (Fig.2). Therefore, some of the retrieved genes preferentially expressed incells derived from specific tissues, e.g., keratin genes related toepithelial cell type (data not shown). Similarly, some of the drugs usedin our study tended to show tissue-oriented efficacy (Fig. 1). There-fore, some of the genes reflecting cells’ tissue of origin correlatingwith chemosensitivity might be associated with sensitivity to suchdrugs. On that occasion, the correlation should be validated withineach set of cell lines derived from the same tissues. On the other hand,there was a substantial population of genes that was not stronglyassociated with tissue of origin, including AKR1B1, LIMK2, andDDB2 (data not shown). These genes are conceivably good candidatesfor predictive markers of drug efficacy.

Clustering analysis of the drugs on the basis of Pearson correlationcoefficients between drug activity and gene expression levels revealedthat drugs with similar modes of action clustered into the same groupsand that the sets of chemosensitivity-associated genes were similarbetween each drug within a group (Fig. 4). Thus, we were able toretrieve genes that commonly correlated within each of the groups ofdrugs with similar functions (Fig. 5, A–D, data not shown), e.g., weidentified the genes encoding survivin (BIRC5) and C-IAP1 of apop-tosis1 (BIRC2) that revealed negative correlations with 5-fluorouracilderivatives, although survivin showed either no significant negativecorrelations or moderately positive correlations to the other genotoxicanticancer drugs (Fig. 5D). The inhibitor of apoptosis family ofproteins, including survivin, has been shown to play important roles inthe suppression of apoptosis and is conceivably a chemoresistancefactor (15, 16). Thus, the most important aspect of our analysis is thatwe were able to retrieve different gene sets that may be used asputative diagnostic markers for chemosensitivity prediction with re-spect to each class of drugs with similar mechanisms of action. Thenext step will be to confirm the predictive power of our findings. Ina recently published study, Staunton et al. (17) identified putativepredictive markers of chemosensitivity and showed the feasibility ofchemosensitivity prediction by transcriptional profiling. In our currentstudy, using a different methodology, we could narrow down thenumber of candidate genes from thousands to dozens that predictchemosensitivity, although we have not completed to evaluate ourfindings. The validation studies using dozens of cell lines outside thepanel are under way.

In summary, we have identified different sets of genes that may actas predictive markers for chemosensitivity to drugs with similarmechanisms of action. Data-mining methodologies described here andelsewhere (17) may be useful to develop systems to select suitablechemotherapies in the future. However, statistical correlations are notthe proof of causal relationships between gene expression and che-mosensitivity. If these causal relationships can be demonstrated ex-perimentally, the resultant gene products may explain the mechanisms

1146

CHEMOSENSITIVITY AND GENE EXPRESSION OF CANCER CELLS

on June 4, 2020. © 2002 American Association for Cancer Research. cancerres.aacrjournals.org Downloaded from

Page 9: An Integrated Database of Chemosensitivity to 55 ...[CANCER RESEARCH 62, 1139–1147, February 15, 2002] An Integrated Database of Chemosensitivity to 55 Anticancer Drugs and Gene

of anticancer drugs. In addition, to elucidate the mechanisms ofanticancer drugs, it is also necessary to determine gene expressionchanges after administration of drugs (18). These studies will help usto discover new targets of cancer chemotherapy, as well as predictivemarkers for conventional drugs.

ACKNOWLEDGMENTS

We thank Dr. Jun-ichi Okutsu for helpful discussions and Hiroko Bando,Noriko Nemoto, and Noriko Sudo for the fabrication of cDNA microarray. Wealso thank Yumiko Nishimura, Mariko Seki, Keiko Sato, Fujiko Ohashi, YokoYoshida, Naohide Sunose, Sachiko Udagawa, and Akiko Kuramochi-Komi forthe determination of chemosensitivity.

REFERENCES

1. Chen, C. J., Chin, J. E., Ueda, K., Clark, D. P., Pastan, I., Gottesman, M. M., andRoninson, I. B. Internal duplication and homology with bacterial transport proteins inthe mdr1 (P-glycoprotein) gene from multidrug-resistant human cells. Cell, 47:381–389, 1986.

2. Yamori, T., Matsunaga, A., Sato, S., Yamazaki, K., Komi, A., Ishizu, K., Mita, I.,Edatsugi, H., Matsuba, Y., Takezawa, K., Nakanishi, O., Kohno, H., Nakajima, Y.,Komatsu, H., Andoh, T., and Tsuruo, T. Potent antitumor activity of MS-247, a novelDNA minor groove binder, evaluated by an in vitro and in vivo human cancer cell linepanel. Cancer Res., 59: 4042–4049, 1999.

3. Ono, K., Tanaka, T., Tsunoda, T., Kitahara, O., Kihara, C., Okamoto, A., Ochiai, K.,Takagi, T., and Nakamura, Y. Identification by cDNA microarray of genes involvedin ovarian carcinogenesis. Cancer Res., 60: 5007–5011, 2000.

4. Monks, A., Scudiero, D., Skehan, P., Shoemaker, R., Paull, K., Vistica, D., Hose, C.,Langley, J., Cronise, P., Vaigro-Wolff, A., et al. Feasibility of a high-flux anticancerdrug screen using a diverse panel of cultured human tumor cell lines. J. Natl. CancerInst. (Bethesda), 83: 757–766, 1991.

5. Kitahara, O., Furukawa, Y., Tanaka, T., Kihara, C., Ono, K., Yanagawa, R., Nita,M. E., Takagi, T., Nakamura, Y., and Tsunoda, T. Alterations of gene expressionduring colorectal carcinogenesis revealed by cDNA microarrays after laser-capturemicrodissection of tumor tissues and normal epithelia. Cancer Res., 61: 3544–3549,2001.

6. Luo, L., Salunga, R. C., Guo, H., Bittner, A., Joy, K. C., Galindo, J. E., Xiao, H.,Rogers, K. E., Wan, J. S., Jackson, M. R., and Erlander, M. G. Gene expressionprofiles of laser-captured adjacent neuronal subtypes. Nat. Med., 5: 117–122, 1999.

7. Van Gelder, R. N., von Zastrow, M. E., Yool, A., Dement, W. C., Barchas, J. D., andEberwine, J. H. Amplified RNA synthesized from limited quantities of heterogeneouscDNA. Proc. Natl. Acad. Sci. USA, 87: 1663–1667, 1990.

8. Naasani, I., Seimiya, H., Yamori, T., and Tsuruo, T. FJ5002: a potent telomeraseinhibitor identified by exploiting the disease-oriented screening program with COM-PARE analysis. Cancer Res., 59: 4004–4011, 1999.

9. Ross, D. T., Scherf, U., Eisen, M. B., Perou, C. M., Rees, C., Spellman, P., Iyer, V.,Jeffrey, S. S., Van de Rijn, M., Waltham, M., Pergamenschikov, A., Lee, J. C.,Lashkari, D., Shalon, D., Myers, T. G., Weinstein, J. N., Botstein, D., and Brown,P. O. Systematic variation in gene expression patterns in human cancer cell lines. Nat.Genet., 24: 227–235, 2000.

10. Scherf, U., Ross, D. T., Waltham, M., Smith, L. H., Lee, J. K., Tanabe, L., Kohn,K. W., Reinhold, W. C., Myers, T. G., Andrews, D. T., Scudiero, D. A., Eisen, M. B.,Sausville, E. A., Pommier, Y., Botstein, D., Brown, P. O., and Weinstein, J. N. A geneexpression database for the molecular pharmacology of cancer. Nat. Genet., 24:236–244, 2000.

11. Bohren, K. M., Bullock, B., Wermuth, B., and Gabbay, K. H. The aldo-keto reductasesuperfamily. cDNAs and deduced amino acid sequences of human aldehyde andaldose reductases. J. Biol. Chem., 264: 9547–9551, 1989.

12. Tang, J. Y., Hwang, B. J., Ford, J. M., Hanawalt, P. C., and Chu, G. Xerodermapigmentosum p48 gene enhances global genomic repair and suppresses UV-inducedmutagenesis. Mol. Cell, 5: 737–744, 2000.

13. Maekawa, M., Ishizaki, T., Boku, S., Watanabe, N., Fujita, A., Iwamatsu, A., Obinata,T., Ohashi, K., Mizuno, K., and Narumiya, S. Signaling from Rho to the actincytoskeleton through protein kinases ROCK and LIM-kinase. Science (Wash. DC),285: 895–898, 1999.

14. Pinkel, D. Cancer cells, chemotherapy and gene clusters. Nat. Genet., 24: 208–209,2000.

15. Ambrosini, G., Adida, C., and Altieri, D. C. A novel anti-apoptosis gene, survivin,expressed in cancer and lymphoma. Nat. Med., 3: 917–921, 1997.

16. Green, D. R. Apoptotic pathways: paper wraps stone blunts scissors. Cell, 102: 1–4,2000.

17. Staunton, J. E., Slonim, D. K., Coller, H. A., Tamayo, P., Angelo, M. J., Park, J.,Scherf, U., Lee, J. K., Reinhold, W. O., Weinstein, J. N., Mesirov, J. P., Lander, E. S.,and Golub, T. R. Chemosensitivity prediction by transcriptional profiling. Proc. Natl.Acad. Sci. USA, 98: 10787–18792, 2001.

18. Dan, S., and Yamori, T. Repression of cyclin B1 expression after treatment withadriamycin, but not cisplatin in human lung cancer A549 cells. Biochem. Biophys.Res. Commun., 280: 861–867, 2001.

1147

CHEMOSENSITIVITY AND GENE EXPRESSION OF CANCER CELLS

on June 4, 2020. © 2002 American Association for Cancer Research. cancerres.aacrjournals.org Downloaded from

Page 10: An Integrated Database of Chemosensitivity to 55 ...[CANCER RESEARCH 62, 1139–1147, February 15, 2002] An Integrated Database of Chemosensitivity to 55 Anticancer Drugs and Gene

2002;62:1139-1147. Cancer Res   Shingo Dan, Tatsuhiko Tsunoda, Osamu Kitahara, et al.   LinesDrugs and Gene Expression Profiles of 39 Human Cancer Cell An Integrated Database of Chemosensitivity to 55 Anticancer

  Updated version

  http://cancerres.aacrjournals.org/content/62/4/1139

Access the most recent version of this article at:

   

   

  Cited articles

  http://cancerres.aacrjournals.org/content/62/4/1139.full#ref-list-1

This article cites 18 articles, 8 of which you can access for free at:

  Citing articles

  http://cancerres.aacrjournals.org/content/62/4/1139.full#related-urls

This article has been cited by 27 HighWire-hosted articles. Access the articles at:

   

  E-mail alerts related to this article or journal.Sign up to receive free email-alerts

  Subscriptions

Reprints and

  [email protected] at

To order reprints of this article or to subscribe to the journal, contact the AACR Publications

  Permissions

  Rightslink site. Click on "Request Permissions" which will take you to the Copyright Clearance Center's (CCC)

.http://cancerres.aacrjournals.org/content/62/4/1139To request permission to re-use all or part of this article, use this link

on June 4, 2020. © 2002 American Association for Cancer Research. cancerres.aacrjournals.org Downloaded from