Keywords
classification, GBM, Glioblastoma multiforme, survival, prognosis, cancer, microarray, DNA methylation, somatic mutation, machine learning, data mining, bioinformatics
Abstract
Glioblastoma multiforme (GBM), a highly aggressive form of brain cancer, results in a median survival of 12–15 months. For decades, researchers have explored the effects of clinical and molecular factors on this disease and have identified several candidate prognostic markers. In this study, we evaluated the use of multivariate classification models for differentiating between subsets of patients who survive a relatively long or short time. Data for this study came from The Cancer Genome Atlas (TCGA), a public repository containing clinical, treatment, histological and biomolecular variables for hundreds of patients. We applied variable-selection and classification algorithms in a cross-validated design and observed that predictive performance of the resulting models varied substantially across the algorithms and categories of data. The best-performing models were based on age, treatments and global DNA methylation. In this paper, we summarise our findings, discuss lessons learned in analysing TCGA data and offer recommendations for performing such analyses.
Original Publication Citation
Piccolo SR, Frey LJ. “Clinical and molecular models of glioblastoma multiforme survival.” International Journal of Data Mining and Bioinformatics 2013, 7:3, 245-265
BYU ScholarsArchive Citation
Stephen, R. Piccolo and Lewis, J. Frey, "Clinical and Molecular Models of Glioblastoma Multiforme Survival" (2013). Faculty Publications. 7506.
https://scholarsarchive.byu.edu/facpub/7506
Document Type
Peer-Reviewed Article
Publication Date
2013-04-17
Publisher
Inderscience
Language
English
College
Life Sciences
Department
Biology
Copyright Use Information
https://lib.byu.edu/about/copyright/