Keywords

classification, GBM, Glioblastoma multiforme, survival, prognosis, cancer, microarray, DNA methylation, somatic mutation, machine learning, data mining, bioinformatics

Abstract

Glioblastoma multiforme (GBM), a highly aggressive form of brain cancer, results in a median survival of 12–15 months. For decades, researchers have explored the effects of clinical and molecular factors on this disease and have identified several candidate prognostic markers. In this study, we evaluated the use of multivariate classification models for differentiating between subsets of patients who survive a relatively long or short time. Data for this study came from The Cancer Genome Atlas (TCGA), a public repository containing clinical, treatment, histological and biomolecular variables for hundreds of patients. We applied variable-selection and classification algorithms in a cross-validated design and observed that predictive performance of the resulting models varied substantially across the algorithms and categories of data. The best-performing models were based on age, treatments and global DNA methylation. In this paper, we summarise our findings, discuss lessons learned in analysing TCGA data and offer recommendations for performing such analyses.

Original Publication Citation

Piccolo SR, Frey LJ. “Clinical and molecular models of glioblastoma multiforme survival.” International Journal of Data Mining and Bioinformatics 2013, 7:3, 245-265

Document Type

Peer-Reviewed Article

Publication Date

2013-04-17

Publisher

Inderscience

Language

English

College

Life Sciences

Department

Biology

University Standing at Time of Publication

Associate Professor

Included in

Biology Commons

Share

COinS