[CANCER RESEARCH 64, 7201–7204, October 15, 2004]
Advances in Brief
A Gene Expression Signature Associated with Metastatic Outcome in
Human Leiomyosarcomas
Yin-Fai Lee,
1
Megan John,
1
Alison Falconer,
1
Sandra Edwards,
1
Jeremy Clark,
1
Penny Flohr,
1
Toby Roe,
1
Rubin Wang,
1
Janet Shipley,
1
Robert J. Grimer,
2
D. Chas Mangham,
2
J. Meirion Thomas,
3
Cyril Fisher,
3
Ian Judson,
1
and Colin S. Cooper
1
1
The Male Urological Cancer Research Centre, Institute of Cancer Research, Surrey;
2
Department of Musculoskeletal Pathology, The Royal Orthopedic Hospital National Health
Service Trust, Birmingham; and
3
Department of Histopathology, The Royal Marsden National Health Service Trust, London, United Kingdom
Abstract
Metastasis is a major factor associated with poor prognosis in cancer,
but little is known of its molecular mechanisms. Although the clinical
behavior of soft tissue sarcomas is highly variable, few reliable determi-
nants of outcome have been identified. New markers that predict clinical
outcome, in particular the ability of primary tumors to develop metastatic
tumors, are urgently needed. Here, we have chosen leiomyosarcoma as a
model for examining the relationship between gene expression profile and
the development of metastasis in soft tissue sarcomas. Using cDNA mi-
croarray, we have identified a gene expression signature associated with
metastasis in sarcoma that allowed prediction of the future development
of metastases of primary tumors (Kaplan-Meier analysis P 0.001). Our
finding may aid the tailoring of therapy for individual sarcoma patients,
where the aggressiveness of treatment is affected by the predicted outcome
of disease.
Introduction
Soft tissue tumors are a heterogeneous group of mesenchymal
tumors that arise as soft tissue masses and that frequently exhibit the
differentiated features of adult soft tissue (1). Major histologic cate-
gories of malignant soft tissue sarcomas include leiomyosarcoma
(smooth muscle), rhabdomyosarcoma (striated muscle), liposarcoma
(fatty tissue), synovial sarcoma, and malignant fibrous histiocytoma.
The disease accounts for 1% of all cancers and is associated with a
substantial mortality rate of 50%, which is related in part to its
propensity for metastasis (1, 2). The clinical behavior of soft tissue
sarcomas is highly variable, but few reliable determinants of outcome
have been identified (2, 3). New markers that predict clinical outcome,
in particular the propensity of primary tumors to develop metastatic
tumors, are urgently needed and would be of great clinical use,
allowing for more selective treatment strategies. In this study, we have
chosen leiomyosarcoma as a model to assess the relationship between
gene expression profiles determined on cDNA microarray and the
clinical outcome of metastasis in soft tissue sarcomas.
Materials and Methods
Microarray Procedures. Sarcoma tissue samples were collected from
patients undergoing surgery at the Royal Marsden National Health Service
Trust, London and the Royal Orthopedic Hospital National Health Service
Trust, Birmingham, United Kingdom. Diagnoses were performed by patholo-
gists with conventional criteria, immunohistochemistry, and electron micros-
copy. This study was done with approval from our local ethics committee.
Tumor and control RNA preparation, cDNA microarray slide preparation,
RNA labeling, and microarray hybridization were performed as in Lee et al.
(4). Hybridized microarray slides were scanned in a GenePix 4000B scanner
(Axon Instruments, Foster City, CA). Slides were scanned at photomultiplier
tube voltage levels that provided a Cy5:Cy3 hybridization ratio across the slide
of 1. We used the GenePix Pro 3.0.6 software (Axon Instruments) to
determine ratios of fluorescent intensities (Cy5:Cy3) for individual cDNA after
subtraction of background. We had previously established the reliability of
these microarray procedures; 12 of 12 genes showing alterations in expression
in microarray exhibited similar alterations when examined by Northern blot
analyses (5).
Analysis of Microarray Data. The scanned image was analyzed with the
GenePix Pro 3.0.6 software (Axon Instruments). Fluorescent signals for both
channels of the spots were determined. A local background in each channel
was also determined for each spot, which is the median fluorescence of pixels
in a halo surrounding the same array spot. Spots or areas of array with defects
were flagged bad and excluded from subsequent analysis. To enhance the
reliability of the expression data, another round of quality filtering was done.
Spots with fluorescent spot intensity in each channel, which were 1.4 times
the local background (medians) of that channel, were considered well meas-
ured (6), and the data were additionally filtered to include only these spots. The
median background intensity was subtracted from the median spot intensity
to generate the background-corrected signal intensity for use in additional
analysis.
We used the GeneSpring software (Silicon Genetics, Redwood City, CA) to
carry out additional microarray analyses. BRB-ArrayTools (Biometric Re-
search Branch, National Cancer Institute, Bethesda, MD) was used for class
comparison analyses (univariate F test/two-sample t test). Fluorescent intensity
ratios of Cy5:Cy3 for individual spots of the filtered data were determined by
dividing the background-corrected intensity for the Cy5 by that of the Cy3
channel. These ratios were then normalized by making the median of all
measurements in each sample to be 1. Genes that have expression data in less
than half of the samples were filtered out before the class comparison analysis.
The samples were log
2
-transformed, and we compared the gene expressions of
the 20 primary tumors (P) and 7 metastatic tumors (M) to find genes differ-
entially expressed between the two classes using supervised class comparison
analysis with a univariate F test (two-sample t test) with randomized variance
model and multivariate permutation tests to control the number of false
discoveries (based on 1000 random permutations of the class labels of the
experiments and controlling the number of false positive to be 30 genes 50%
of the time, univariate P equals 0.0122).
Two-dimensional hierarchical clustering was then applied to the log-trans-
formed data with average-linkage clustering with Pearson correlation around
zero as the similarity metric for the 335 genes identified as differentially
expressed between primary and metastatic sarcomas. This analysis divided
thirty nonmetastatic tumors (P, PM, and LR) into two categories (groups 1 and
2). It is considered unlikely that exposure to chemotherapy would influence our
analysis because only a single patient received chemotherapy that finished 5
weeks before surgery.
We refined the 335 differentially expressed gene list with two different
supervised learning methods to find a reduced set of discriminating genes best
Received 5/12/04; revised 8/20/04; accepted 8/31/04.
Grant support: Cancer Research UK and the Alexander Boag Sarcoma Fund.
The costs of publication of this article were defrayed in part by the payment of page
charges. This article must therefore be hereby marked advertisement in accordance with
18 U.S.C. Section 1734 solely to indicate this fact.
Note: Additional minimum information about microarray experiment (MIAME) com-
pliant data will be available at ArrayExpress (www.ebi.ac.uk/arrayexpress) at a later date.
Requests for reprints: Yin-Fai Lee, The Male Urological Cancer Research Centre,
Institute of Cancer Research, 15 Cotswold Road, Belmont, Sutton, Surrey SM2 5NG,
United Kingdom. E-mail: Yin-Fai.Lee@icr.ac.uk.
©2004 American Association for Cancer Research.
7201
Research.
on November 18, 2015. © 2004 American Association for Cancer cancerres.aacrjournals.org Downloaded from