Additional file 3.
A table listing tissue specific predictors of clinical characteristics based upon gene expression in adjacent epithelium. The poor quality of the predictors is readily visible from the error rate for the predictors in the first column of the table. The error rate is the fraction of times the predictor misclassifies a sample under cross-validation. Predictors were trained using gene sets from class distinction using SAM or LIMMA. For some combinations of clinical characteristics and class distinction algorithm, no genes passed the filtering criteria, and no predictor could be trained. In such cases the rows are omitted from the table. The gene set size is the initial size of the candidate gene set from which a predictor is built. This set is also selected under cross-validation. The training error is the rate of misclassification for samples included in the training set. The PAM cross-validation error rate reported by the PAM algorithm  does not account for the selection of the candidate gene set under cross-validation. The predictor size is the number of genes in the predictor.
Format: PDF Size: 23KB Download file
This file can be viewed with: Adobe Acrobat Reader
Finak et al. Breast Cancer Research 2006 8:R58 doi:10.1186/bcr1608