prostate {integrativeME} | R Documentation |
Clinical and a subset of gene expression data from the Stephenson et al. (2005) study.
data(prostate)
A list containing the following components:
type
cont
indep
MElogreg
or MEindep
model in the mixture of experts methodology.loc
MEloc
model in the mixture of experts methodology.loc.ind
The data set from Stephenson et al. (2005) was built from tissue samples obtained from 79 patients all treated by radical prostatectomy. There were 37 samples which were classified as recurrent and 42 as non-recurrent primary prostate tumor. Samples were snap frozen and gene expression analysis was carried out using the Affymetrix U133A human gene array which has 22,283 features. After a prefiltering step, the analyzed data set contained 7,884 features. The clinical data and microarray data were measured on the same set of 79 patients.
For the location model, variables 'semi-vesicle invasion' and 'lymph node involvement' were merged into a single categorical variable (called the location variable).
The data set was obtained upon request to the authors of the study. The original data with 7,884 transcripts can be downloaded as an .RData file from http://www.math.univ-toulouse.fr/~lecao/package.html
Stephenson, A.J., Smith, A., Kattan, M.W., Satagopan, J., Reuter, V.E., Scardino, P.T. and Gerald, W.L. (2005). Integration of gene expression profiling and clinical variables to predict prostate carcinoma recurrence after radical prostatectomy. Cancer, 104, 2, 290-298.
Hunt, L. and Jorgensen, M. (1999). Mixture model clustering using the MULTIMIX program. Australian & New Zealand Journal of Statistics, 41, 2, 154–171.