Prognostic Gene Discovery in Glioblastoma Patients using Deep Learning

Wong, Kelvin K.; Rostomily, Robert; Wong, Stephen T. C.

doi:10.3390/cancers11010053

Open AccessArticle

Prognostic Gene Discovery in Glioblastoma Patients using Deep Learning

by

Kelvin K. Wong

^1,2,3,*,

Robert Rostomily

⁴ and

Stephen T. C. Wong

^1,3,5,6

¹

Department of Systems Medicine and Bioengineering, Houston Methodist, Houston, TX 77030, USA

²

Department of Neurological Surgery, Weill Cornell Medicine, New York, NY 10065, USA

³

Department of Radiology, Weill Cornell Medicine, New York, NY 10065, USA

⁴

Department of Neurosurgery, Houston Methodist Neurological Institute, Houston, TX 77030, USA

⁵

Department of Neuroscience, Weill Cornell Medicine, New York, NY 10065, USA

⁶

Department of Pathology and Laboratory Medicine, Weill Cornell Medicine, New York, NY 10065, USA

^*

Author to whom correspondence should be addressed.

Cancers 2019, 11(1), 53; https://doi.org/10.3390/cancers11010053

Submission received: 14 November 2018 / Revised: 16 December 2018 / Accepted: 24 December 2018 / Published: 8 January 2019

(This article belongs to the Special Issue Glioblastoma: State of the Art and Future Perspectives)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

This study aims to discover genes with prognostic potential for glioblastoma (GBM) patients’ survival in a patient group that has gone through standard of care treatments including surgeries and chemotherapies, using tumor gene expression at initial diagnosis before treatment. The Cancer Genome Atlas (TCGA) GBM gene expression data are used as inputs to build a deep multilayer perceptron network to predict patient survival risk using partial likelihood as loss function. Genes that are important to the model are identified by the input permutation method. Univariate and multivariate Cox survival models are used to assess the predictive value of deep learned features in addition to clinical, mutation, and methylation factors. The prediction performance of the deep learning method was compared to other machine learning methods including the ridge, adaptive Lasso, and elastic net Cox regression models. Twenty-seven deep-learned features are extracted through deep learning to predict overall survival. The top 10 ranked genes with the highest impact on these features are related to glioblastoma stem cells, stem cell niche environment, and treatment resistance mechanisms, including POSTN, TNR, BCAN, GAD1, TMSB15B, SCG3, PLA2G2A, NNMT, CHI3L1 and ELAVL4.

Keywords:

deep learning; discovery; glioblastoma; glioblastoma stem cells; survival prediction

1. Introduction

Deep learning [1,2,3,4,5,6,7] has been used to learn prognostic subtypes of glioblastoma using pan-cancer gene expression data from The Cancer Genome Atlas (TCGA) [8], predict drug synergy based on cancer cell gene expression data [9], and predict survival based on multi-omics integrated data in liver cancer [10] etc. [11,12,13]. It is important to explain the model in a meaningful way to understand the deep learning model and its limitations. A typical deep learning model involves millions of parameters, which makes it a difficult task to understand. We propose to use feature importance ranking within the deep learning model. While feature importance ranking is popular with machine learning [14], its use within the deep learning model is rare, especially in cancer genomics where a model usually includes thousands of features. In this paper, we expand the permutation feature importance techniques to deep learning. Our goal is to study the inside of a trained deep learning model to discover prognostic gene features in glioblastoma.

Conventional machine learning approaches have been used to determine the gene expression that are prognostic to glioblastoma patient survival [15,16,17]. Glioblastoma gene expression regression modeling using the least absolute shrinkage and selection operator (Lasso) flavor strategy performed better when cancer pathway genes were used as input variables compared to whole genome input [16]. However, these types of penalized regression methods often require dropping a large number of genes in order to fit the survival outcome, hindering biological pathway interpretation and introducing random bias toward the selected gene factors. Deep learning offers the capacity to model a large number of differentially expressed genes, is less susceptible to multicollinearity problem, and generalizes better. Deep learning based on transcriptome data has only recently been used to determine the primary effects of gene features that are prognostic to survival of glioblastoma (GBM) or other cancer types [8,18]. However, few studies work on what was learned in these deep learning models.

Using differentially expressed genes from the GBM-specific TCGA database as inputs, we tested the hypothesis that deep learning can model the relationship between specific genes and the corresponding protein effect to predict patient survival prognosis. Like most of the deep learning models, our model learned a set of features at the last hidden layer, which in this case linearly modulates the survival risk of patients. We hypothesize these features contains the key factors that determine patients’ overall survival for those who have gone through standard of care therapy, from surgery to chemotherapy, without undergoing targeted therapy.

2. Results

2.1. Deep Learning Model

We trained and optimized the deep learning model to generate a network architecture consisting of an input layer feeding to two hidden layers (82 nodes then 27 nodes) and connected to one single output node to predict patient survival prognosis.

The validation concordance index is 0.69, the corresponding training concordance index is 0.73, and the validation concordance index of each sample partition is evaluated to be with a mean ± 1 SD concordance index of 0.70 ± 0.07. The out-of-sample testing concordance is 0.63 and is within the uncertainty of the validation concordance. The 95% confidence intervals of all testing, validation, and training concordance indexes do not include 0.5.

Important genes that contribute to the overall deep learned model were identified according to the input permutation method. A frequency analysis of genes occurring in the last hidden layer is listed in Table 1. The top 10 ranked genes are either known to be associated with glioblastoma survival, glioblastoma cancer cell migration, or glioblastoma cancer stem cells, or are known to be related to other types of cancer with mechanistic significance. The top 10 genes are TNR, GAD1, TMSB15B, POSTN, SCG3, PLA2G2A, NNMT, CHI3L1, and ELAVL4. The ranked gene importance at each network node is available in Supplementary Table S1. The entire frequency analysis table is available in Supplementary Table S2.

In the deep-learned model, no proportional hazard assumption is used. Since the last hidden layer network node outputs are combined linearly with a weight vector to predict patient survival prognosis, the weight vector determines the relative risk from each of the 27 network nodes. Putting these network nodes in a Cox proportional hazard model showed a model concordance of 0.71, with 14 out of 27 network nodes statistically significant using Wald test at p < 0.05; only network node 10 is stratified for all network nodes to satisfy the proportional hazard assumption at p < 0.05 (Table 2).

2.2. Deep Learning Model Performance Comparison with Penalized Cox Regression Models

Using the training dataset, the k-fold cross validation training resulted in the ridge, adaptive Lasso, and elastic net Cox regression models. These models perform consistently well with concordance indexes at 0.70, 0.69, and 0.70, respectively and are comparable to the deep learning model in performance. In the testing dataset, however, ridge, adaptive Lasso, and elastic net Cox regression models resulted in concordance indexes of 0.58, 0.56, and 0.56, with the 95% confidence intervals of the model concordance index including 0.5. In another word, these models performed close to random and failed to predict survival in the testing dataset, which could be due to the challenge of prediction in a small dataset with 49 patients. This is in sharp contrast to the deep learning model which still performs well in the small testing dataset.

2.3. Network Node Parameters Improved the Baseline Cox Proportional Hazard Model

A baseline Cox proportional hazard survival model constructed with clinical covariates, including age, gender, Karnofsky Performance Status (KPS), tumor subtypes, and therapy options performed at a concordance value of 0.68. Age, gender, KPS, chemoradiation, and proneural subtype all reached statistical significance using Wald test at p < 0.05, with no parameters violating the proportional hazard assumption at p < 0.05. Since most patients with the Glioma CpG island methylator phenotype (G-CIMP) mutation also harbor the IDH1 heterozygous Arg132-to-His (R132H) point mutation, the multicollinearity between these two variables (correlation coefficient = 0.845) led to large standard error on the parameter estimation, affecting statistical significance.

After adding all deep-learned network nodes, the baseline Cox model (Table 3) concordance index increased from 0.68 to 0.76 with age, KPS, chemoradiation, and proneural subtype statistically significant (Wald test, p < 0.05); four risk-increasing network nodes (8, 17, 24, and 25) and four risk-decreasing nodes (4, 10, 16, and 23) were also statistically significant (Wald test, p < 0.05), with all covariates satisfying the proportional hazard assumption.

2.4. Prognostic Significance Validation of Gene Set with External Data

The chance of a gene deemed important to a small number of network nodes is much higher than it is to a large number of nodes, as shown in Figure 1. We used a probability threshold of p < 0.01 and selected a set of genes that are important in at least 10 out of 27 network nodes. These important genes form a 39-gene signature and they are listed in Table 4. Using this gene signature, statistical significance (log-rank test, p < 1.5 × 10⁻⁶) was achieved in seven glioblastoma studies and three low-grade glioma studies in separating the low-risk and high-risk groups with mean ± 1 SD concordance index of 0.79 ± 0.09. The Kaplan–Meier survival curves of the low-risk and high-risk groups in these studies are shown in Figure 2. There are no notable pathway enrichment in the 39-gene signature.

In the discovery process, two particular genes stand out with high hazard ratio (HR) and concordance index (CI) in univariate Cox model, AQP1 (HR = 3.3, p < 0.05, CI = 0.711) and MEOX2 (HR=2.5, p < 0.05, CI = 0.675), both of which are lesser known to be associated with survival in gliomas. Both genes are statistically significant and independently prognostic to survival in the multivariate Cox model (MEOX2 HR = 2.2, AQP1 HR = 1.55, both p < 0.05, CI = 0.796) in the TCGA GBMLGG dataset.

3. Discussion

Prior reports of deep learning model in cancer research [8,18,19] were derived from multiple cancer types to increase sample numbers, but fall short in identifying genes or primary mutations of mechanistic interest. A recent deep feedforward network studies used gene expression feature selection and outcome classification in TCGA breast cancer data as well as in TCGA kidney renal cell carcinoma [20]. In that study, only importance features to the model output were discovered and a number of biologically relevant features were found.

In this paper, we expand the input permutation method for feature importance ranking in deep learning network. Input permutation method is a useful technique for feature importance ranking in machine learning [14] and is broadly applicable to various models. It is usually used to rank feature importance at the model output. Expanding this method to rank features at any hidden layer within a deep learning model opened up many possibilities. It solidifies our understanding of the model and helps in explaining the deep learning model, which is usually considered a black box. In the proposed GBM model, we found that the last hidden layer contains important features that are far more biologically relevant than those obtained from the output layer.

Compared with gene-signature from previously identified GBM molecular subtypes (classical, proneural, and mesenchymal) [17,21,22], our 39-gene signature has a small overlaps with the 840-gene signatures in classical (8 out of 210), proneural (6 out of 210), and mesenchymal (5 out of 210) subtypes, or 12 genes overlap among all subtypes. The other 27 prognostic genes are likely due to shared biological mechanism(s) among tumor subtypes that are crucial to patient survival. The improvement in model concordance from 0.68 to 0.76 with the addition of deep learned network parameters confirmed the prognostic value of these additional genes in additional to known tumor subtype. It is also quite remarkable that the 39-gene signature learned from one GBM dataset is able to stratify patients in several other low grade glioma datasets as well as other GBM datasets. In addition, two lesser known genes, AQP1 and MEOX2, are discovered to be prognostic to gliomas patients overall survival through the deep learning approach.

Our deep learning features revealed many genes of interest to glioblastoma stem cells mechanism. For example, both glutamate decarboxylase 1 (GAD1) and Chitinase 3 Like 1 (CHI3L1/YKL-40) have been recently identified as targets of Notch inhibitors (alpha secretase and gamma secretase inhibitors) in treating glioblastoma stem cells. Notch inhibitors work via Notch binding to YKL-40 and leukemia inhibitory factor (LIF) promotors and increased survival in a GBM stem cell orthotopic mouse model [23]. Epigenetic upregulation of glutamate decarboxylase 1 (GAD1) has been shown to program the aggressive features of cancer cell metabolism in brain metastatic microenvironment [24]. Chitinase 3 Like 1 (CHI3L1/YKL-40) was also reported previously to be prognostic to glioma patient survival [25] and involved in the angiogenesis, radioresistance, and progression of glioblastoma in vivo [26]. Periostin (POSTN) has been shown to impact GBM stem cell tumorigenicity and GBM patient survival [27]. POSTN is secreted by glioblastoma stem cells to recruit tumor-associated macrophages in order to promote malignant growth [28] and regulate tumor resistance to anti-angiogenic therapy [29]. Nicotinamide N-methyltransferase (NNMT) was also reported to regulate mesenchymal glioblastoma stem cell maintenance by depletion of methionine and shift tumor towards a mesenchymal phenotype and accelerated tumor growth [30]. NNMT was reported to be a prognostic marker for glioblastoma [31], inhibiting tumor suppressor protein phosphatase 2 (PP2A) at the epigenome and proteome level and concomitantly activates prosurvival serine/threonine kinases. Receptor tyrosine-protein kinase ErbB-3 (ERBB3) is known to mediate glioblastoma cancer stem-like cell resistance to EGFR inhibition [32].

Brevican (BCAN) which is known to bind to tenascin-R (TNR) with high affinity [33], is highly expressed in gliomas, initiating cells’ extracellular niche in human GBM tumors and is expressed by glioma initiating cells in vitro [34]. Though BCAN knockdown does not affect glioblastoma initiating cell viability in vitro [34], it promotes glioma cell adhesion and migration in vitro [35] and the knockdown of the gene inhibits both cell motility in vitro and tumorigenicity in vivo [35]. Tenascin-C (TNC) and tenascin-R (TNR) are two of the three members of the tenascin family of extracellular matrix glycoproteins. TNC (which occurred in four networks) has been shown to promote glioblastoma invasion [36] and is heavily involved in pro-angiogenic and anti-angiogenic signaling in glioblastoma [37], as well as having an impact on survival [38]. A strong TNR expression is linked to non-invasive brain tumor (pilocytic astrocytomas) whereas a weak expression is detected in glioblastoma [39]. The exact role of TNR, particularly in glioblastoma stem cells extracellular niche, is a subject worth exploring.

On the other hand, ELAV-like RNA binding protein 4 (ELAVL4) has been shown to modulate radiation sensitivity in vitro in non-small cell lung cancer [40]. Secretogranin III (SCG3) has been shown to be involved in anti-angiogenesis in diabetic retinopathy [41]. Secreted phospholipase A2 group IIA (PLA2G2A) was shown to induce phosphorylation of the EGFR to induce proliferation through a PKC-dependent pathway in human astrocytoma in vitro [42]. Interestingly, the thymosin beta 15B (TMSB15B) is involved in epidermal growth factor-induced migration of prostate cancer cells [43].

Deep learning models that are fully connected and have high dimensional inputs are notoriously difficult to train due to their large number of variables. In our case, the number of variables, about 300,000, is much larger than the number of cases (n = 492). Our model is able to generalize on out-of-sample patient cases and has comparable performance in validation concordance index across different data sample splits. Due to the limitation of a single gene chip platform used in this study, its out-of-sample performance on another chip platform may need evaluation.

There are other limitations on the performance of deep learning based on differentially expressed genes. For instance, with 2-fold change cutoff used in this study, we may be artificially removing genes that are prognostic to survival. In addition, the relatively small number of patient samples in this study limits the depth of the model.

Finally, deep learning without explicit biological knowledge or network architecture constraints is not expected to learn biological structure within the data, so care must be taken with biological interpretation. In our case, we identify the prognostic genes by identifying genes with disproportional impact to last hidden layer and the occurrence frequency. Lastly, using deep learning to extract prognostic differential expressed genes for survival prediction provides a flexible way to combine gene expression data with other clinical covariates such as age, KPS, therapy options, and tumor subtypes to enable better patient survival stratification.

4. Materials and Methods

4.1. Gene Expression Data Analysis

The Cancer Genome Atlas (TCGA) is a publicly repository with patient-derived clinical, imaging, and genomic data that has been deidentified and contains no linkage to patient identifiers, no institutional review board or Health Insurance Portability and Accountability Act approval was required for our study. Microarray data from untreated glioblastoma patients (n = 492) were retrieved from TCGA. Gene expression level 1 data from the Affymetrix Human Genome U133A platform were used. The data were processed by software script using R (version 3.2, https://www.R-project.org/, Vienna, Austria), affy [44] and affycoretools [45] packages, and quality control was conducted using affyQCReport package. The probe level data were normalized by gcrma [46] package to control for batch variations, and the probe level expressions are compared to the normal brain tissue group (n = 10). Statistically significant gene probe changes were selected with a threshold of p < 0.01 with at least 2-fold biological change adjusted for multiple comparisons using the beta uniform mixture model [47]. Genes with a Pearson correlation coefficient higher than 0.8 were represented by one gene to reduce the strong intrinsic correlation. The number of gene probes was reduced to 3581 and used as model inputs.

Clinical data such as age, gender, MGMT methylation, G-CIMP, IDH1/2 mutation, cancer subtype, and therapy information were retrieved from a TCGA publication [22].

4.2. Deep Learning Model

The deep learning model was built using Tensorflow 1.3 and Python 3.6 platforms using the deep learning survival modeling framework [48]. Gene expression dataset is randomly partitioned into ten equal partitions, with the first as the testing set, the second as the validation set, and the remaining as training sets. Stratified sampling was used to preserve the survival time distribution among each data partitions to fully capture the heterogeneity from multiple sites. The testing set contained samples from 11 sites whereas the validation set contained samples from 12 sites. The network structure consists of an input layer, one/two hidden layers with rectifier linear unit (Relu) functions, and an output layer with a single node corresponding to the survival prognosis of each patient. The partial likelihood function was used as a loss function and an L₂ penalty was applied on all network weights to prevent overfitting, [49] retaining the advantage of interpretation like Cox’s proportional hazard. Batch-normalization [50] was used in each layer to improve learning stability.

Network structures, including number of hidden layers and number of hidden nodes, are varied to arrive at different models with a maximum of two hidden layers. Hyperparameter tuning on hidden layer(s), nodes and learning rate used concordance index as performance criteria in both the training and validation datasets. A maximum of 1000 epochs were allowed for computation convergence. The optimum hidden layers and nodes were determined by the maximum concordance index in the validation dataset; parameters were stored as a model for testing.

The gene expression dataset was randomly split to training, validation, and testing datasets with ratios of 80%, 10%, and 10%, respectively, while preserving the distribution of survival time to maximize the available datasets for training. The performance variability of the deep learning model was tested by rotating each sample partition as a validation dataset while using the rest for training. After the validation statistics were evaluated, its performance was tested on an out-of-sample data set that was never used in the modeling process.

4.3. Deep Learning Model Performance Comparison with Penalized Cox Regression Models

A comparison was made between deep learning model and penalized Cox regression models, including ridge, adaptive Lasso [51], and elastic net [52] using the glmnet package [53] (version 2.0.12).

Cox regression method assumes a semi-parametric hazard form of:

h_{i} (t) = h_{0} (t) e^{x_{i}^{T} β}

where

h_{i} (t)

is the hazard for patient I at time t,

h_{0} (t)

is the baseline hazard, and

β

is a fixed length vector of length p. In penalized Cox regression, models are fitted by maximizing the penalized partial log-likelihood function. The penalized partial log-likelihood function is given by:

\prod_{i = 1}^{m} \frac{e^{x_{i}^{T} β}}{\sum_{j \in R_{i}} e^{x_{i}^{T} β}} - \sum_{j = 1}^{p} p_{α, λ} (| β_{j} |)

where

p_{α, λ} (| \cdot |)

is the penalty function with tuning parameters λ and α.

For ridge regression, the penalty function takes the form:

p_{α, λ} (| β_{j} |) = λ β_{j}^{2}

For adaptive Lasso regression, the penalty function takes the form:

p_{α, λ} (| β_{j} |) = λ w_{j} | β_{j} |

where

w_{j} = 1 / β_{j 0}

.

β_{j 0}

the initial estimated of

β_{j}

, which in this case is estimated by ridge regression.

For elastic net, the penalty function takes the form:

p_{α, λ} (| β_{j} |) = λ (α | β_{j} | + (1 - α) \frac{1}{2} β_{j}^{2})

where

α \in (0, 1]

.

Model performances were evaluated using concordance index and the same survival time stratified training/validation/testing datasets are used as in the deep learning model for a fair comparison. A 9-fold cross validation was used. A minimum lambda model was chosen as the lamda.1se models are not numerically stable. Feature importance of Lasso, adaptive Lasso, and elastic net methods was identified by the absolute amplitude of regression coefficients.

4.4. Impact of Deep Learning Network Features on Baseline Survival Model

The hidden nodes’ outputs are Relu functions that are positive or zero, functioning like an on/off switch that allows effects from a previous level of interacting genes to pass through. Whether these network signatures provide distinct or complementary prognostic value to the baseline survival model was evaluated using Cox proportional hazard model.

A baseline Cox model was constructed from clinical covariates, tumor subtypes, therapy options and genetic mutation/methylation status known to be associated with patient survival. Clinical and genetic mutation/methylation covariates include age, Karnofsky performance scale, gender, IDH1/2 mutation status, MGMT, G-CIMP methylation status, tumor subtypes (proneural, mesenchymal and classical) as well as chemotherapy/radiation/chemoradiation therapy options. To evaluate the additional prognostic value of deep learning network nodes, they were added to the baseline Cox proportional hazard model. The contribution of these deep learning features in improving survival prediction over the baseline model was evaluated using the concordance index.

4.5. Identifying Important Genes in Deep Learning Model

To identify the genes important to survival, we permuted the input genes one gene at a time to break the correlation between the input gene and the output risk [54]. An important gene that contributes significantly to the overall model when permuted across the patient group will impose a significant change to the predicted patient survival, whereas an unimportant gene will not. The process was repeated five times and the average change in patient risk was used. The high impact genes were identified as those that affect predicted patient survival outside the 95% confidence interval of the average change due to single gene permutation.

4.6. Prognostic Significance Validation of Gene Set with External Data

To compare the prognostic significance of our gene set in predicting survival in glioblastoma patients, we evaluated it in seven glioblastoma and three low-grade glioma studies (Table 5) using Cox proportional analysis with SurvExpress platform [55]. The samples were split by the median of the prognostic index to designate low-risk and high-risk groups. The top ranked 39 genes, which correspond to p < 0.01 or equivalently any gene occurring at least 10 times in the 27 network nodes, were chosen as gene biomarkers. The gene biomarkers were evaluated for survival difference between the low-risk and high-risk groups using log-rank test. The prognostic index is the linear component of the exponential function in the Cox model.

5. Conclusions

In conclusion, we discovered that deep learning survival prediction model learned genes that are strongly related to glioblastoma stem cells and/or treatment resistant genes which may be useful to inform patient therapy. Compared with traditional Cox proportional hazard survival models, deep learning networks provide non-redundant prognostic covariates to patient survival even in the presence of strong clinical predictors. Using this approach, we identified many specific genes that are potential biomarkers or therapeutic targets.

Supplementary Materials

The following are available online at https://www.mdpi.com/2072-6694/11/1/53/s1, Table S1: Ranked gene importance in prognostic network nodes, Table S2: Frequency analysis of genes occuring in 27 prognostic network nodes.

Author Contributions

Conceptualization, K.K.W., S.T.C.W.; Methodology, K.K.W., S.T.C.W.; Formal analysis, K.K.W.; Writing—original draft preparation, K.K.W.; Writing—review and editing, S.T.C.W, R.R.; Supervision, K.K.W., S.T.C.W.; Project administration, K.K.W., S.T.C.W.; Funding acquisition, K.K.W., S.T.C.W.

Funding

This research was funded by Ting Tsung and Wei Fong Chao Foundation, John S Dunn Research Foundation, NIH U01 CA188388 and NIH R01 NS091251.

Acknowledgments

The results published here are in whole or part based upon data generated by the TCGA Research Network: http://cancergenome.nih.gov/. The authors sincerely acknowledge the helpful discussions with Nan Xiang and Hong Zhao, both from the Systems Medicine and Bioengineering Department, Houston Methodist Research Institute.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ripley, B.D. Pattern Recognition and Neural Networks; Cambridge University Press: Cambridge, UK, 1996. [Google Scholar]
Bishop, C.M. Neural Networks for Pattern Recognition; Oxford University Press: Oxford, UK, 1995. [Google Scholar]
Cheng, B.; Titterington, D.M. Neural Networks: A Review from a Statistical Perspective. Stat. Sci. 1994, 9, 2–30. [Google Scholar] [CrossRef]
Kuan, C.M.; White, H. Artificial Neural Networks: An Econometric Perspective. Econom. Rev. 1994, 13, 1–91. [Google Scholar] [CrossRef]
Ripley, B.D. Statistical Aspects of Neural Networks; Chapman & Hall: Boca Raton, FL, USA, 1993. [Google Scholar]
Schmidhuber, J. Deep learning in neural networks: An overview. Neural Netw. 2015, 61, 85–117. [Google Scholar] [CrossRef] [Green Version]
Cherkassky, V.; Friedman, J.H.; Wechsler, H. Statistics to Neural Networks: Theory and Pattern Recognition Applications; Springer: Berlin, Germany, 1994. [Google Scholar]
Young, J.D.; Cai, C.; Lu, X. Unsupervised deep learning reveals prognostically relevant subtypes of glioblastoma. BMC Bioinform. 2017, 18, 381. [Google Scholar] [CrossRef] [Green Version]
Preuer, K.; Lewis, R.P.I.; Hochreiter, S.; Bender, A.; Bulusu, K.C.; Klambauer, G. DeepSynergy: Predicting anti-cancer drug synergy with Deep Learning. Bioinformatics 2017. [Google Scholar] [CrossRef] [PubMed]
Chaudhary, K.; Poirion, O.B.; Lu, L.; Garmire, L.X. Deep Learning based multi-omics integration robustly predicts survival in liver cancer. Clin. Cancer Res. 2017. [Google Scholar] [CrossRef] [PubMed]
Putin, E.; Mamoshina, P.; Aliper, A.; Korzinkin, M.; Moskalev, A.; Kolosov, A.; Ostrovskiy, A.; Cantor, C.; Vijg, J.; Zhavoronkov, A. Deep biomarkers of human aging: Application of deep neural networks to biomarker development. Aging 2016, 8, 1021–1033. [Google Scholar] [CrossRef] [PubMed]
Angermueller, C.; Lee, H.J.; Reik, W.; Stegle, O. DeepCpG: Accurate prediction of single-cell DNA methylation states using deep learning. Genome Biol. 2017, 18, 67. [Google Scholar] [CrossRef] [PubMed]
Tan, J.; Ung, M.; Cheng, C.; Greene, C.S. Unsupervised feature construction and knowledge extraction from genome-wide assays of breast cancer with denoising autoencoders. Pac. Symp. Biocomput. 2015, 132–143. [Google Scholar] [CrossRef]
Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
Kim, Y.W.; Koul, D.; Kim, S.H.; Lucio-Eterovic, A.K.; Freire, P.R.; Yao, J.; Wang, J.; Almeida, J.S.; Aldape, K.; Yung, W.K. Identification of prognostic gene signatures of glioblastoma: A study based on TCGA data analysis. Neuro-Oncology 2013, 15, 829–839. [Google Scholar] [CrossRef] [PubMed]
Kim, H.; Bredel, M. Feature selection and survival modeling in The Cancer Genome Atlas. Int. J. Nanomed. 2013, 8 (Suppl. 1), 57–62. [Google Scholar] [CrossRef] [Green Version]
Verhaak, R.G.; Hoadley, K.A.; Purdom, E.; Wang, V.; Qi, Y.; Wilkerson, M.D.; Miller, C.R.; Ding, L.; Golub, T.; Mesirov, J.P.; et al. Integrated genomic analysis identifies clinically relevant subtypes of glioblastoma characterized by abnormalities in PDGFRA, IDH1, EGFR, and NF1. Cancer Cell 2010, 17, 98–110. [Google Scholar] [CrossRef] [PubMed]
Yousefi, S.; Amrollahi, F.; Amgad, M.; Dong, C.; Lewis, J.E.; Song, C.; Gutman, D.A.; Halani, S.H.; Velazquez Vega, J.E.; Brat, D.J.; et al. Predicting clinical outcomes from large scale cancer genomic profiles with deep survival models. Sci. Rep. 2017, 7, 11707. [Google Scholar] [CrossRef] [PubMed]
Martinez-Ledesma, E.; Verhaak, R.G.; Trevino, V. Identification of a multi-cancer gene expression biomarker for cancer clinical outcomes using a network-based algorithm. Sci. Rep. 2015, 5, 11966. [Google Scholar] [CrossRef] [Green Version]
Kong, Y.; Yu, T. A graph-embedded deep feedforward network for disease outcome classification and feature selection using gene expression data. Bioinformatics 2018, 34, 3727–3737. [Google Scholar] [CrossRef]
Wang, Q.; Hu, B.; Hu, X.; Kim, H.; Squatrito, M.; Scarpace, L.; deCarvalho, A.C.; Lyu, S.; Li, P.; Li, Y.; et al. Tumor Evolution of Glioma-Intrinsic Gene Expression Subtypes Associates with Immunological Changes in the Microenvironment. Cancer Cell 2017, 32, 42–56. [Google Scholar] [CrossRef]
Brennan, C.W.; Verhaak, R.G.; McKenna, A.; Campos, B.; Noushmehr, H.; Salama, S.R.; Zheng, S.; Chakravarty, D.; Sanborn, J.Z.; Berman, S.H.; et al. The somatic genomic landscape of glioblastoma. Cell 2013, 155, 462–477. [Google Scholar] [CrossRef] [PubMed]
Floyd, D.H.; Kefas, B.; Seleverstov, O.; Mykhaylyk, O.; Dominguez, C.; Comeau, L.; Plank, C.; Purow, B. Alpha-secretase inhibition reduces human glioblastoma stem cell growth in vitro and in vivo by inhibiting Notch. Neuro-Oncology 2012, 14, 1215–1226. [Google Scholar] [CrossRef]
Schnepp, P.M.; Lee, D.D.; Guldner, I.H.; O’Tighearnaigh, T.K.; Howe, E.N.; Palakurthi, B.; Eckert, K.E.; Toni, T.A.; Ashfeld, B.L.; Zhang, S. GAD1 Upregulation Programs Aggressive Features of Cancer Cell Metabolism in the Brain Metastatic Microenvironment. Cancer Res. 2017, 77, 2844–2856. [Google Scholar] [CrossRef]
Steponaitis, G.; Skiriute, D.; Kazlauskas, A.; Golubickaite, I.; Stakaitis, R.; Tamasauskas, A.; Vaitkiene, P. High CHI3L1 expression is associated with glioma patient survival. Diagn. Pathol. 2016, 11, 42. [Google Scholar] [CrossRef] [PubMed]
Francescone, R.A.; Scully, S.; Faibish, M.; Taylor, S.L.; Oh, D.; Moral, L.; Yan, W.; Bentley, B.; Shao, R. Role of YKL-40 in the angiogenesis, radioresistance, and progression of glioblastoma. J. Biol. Chem. 2011, 286, 15332–15343. [Google Scholar] [CrossRef] [PubMed]
Mikheev, A.M.; Mikheeva, S.A.; Trister, A.D.; Tokita, M.J.; Emerson, S.N.; Parada, C.A.; Born, D.E.; Carnemolla, B.; Frankel, S.; Kim, D.H.; et al. Periostin is a novel therapeutic target that predicts and regulates glioma malignancy. Neuro-Oncology 2015, 17, 372–382. [Google Scholar] [CrossRef] [PubMed]
Zhou, W.; Ke, S.Q.; Huang, Z.; Flavahan, W.; Fang, X.; Paul, J.; Wu, L.; Sloan, A.E.; McLendon, R.E.; Li, X.; et al. Periostin secreted by glioblastoma stem cells recruits M2 tumour-associated macrophages and promotes malignant growth. Nat. Cell Biol. 2015, 17, 170–182. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Park, S.Y.; Piao, Y.; Jeong, K.J.; Dong, J.; de Groot, J.F. Periostin (POSTN) Regulates Tumor Resistance to Antiangiogenic Therapy in Glioma Models. Mol. Cancer Ther. 2016, 15, 2187–2197. [Google Scholar] [CrossRef]
Jung, J.; Kim, L.J.Y.; Wang, X.; Wu, Q.; Sanvoranart, T.; Hubert, C.G.; Prager, B.C.; Wallace, L.C.; Jin, X.; Mack, S.C.; et al. Nicotinamide metabolism regulates glioblastoma stem cell maintenance. JCI Insight 2017, 2. [Google Scholar] [CrossRef] [Green Version]
Palanichamy, K.; Kanji, S.; Gordon, N.; Thirumoorthy, K.; Jacob, J.R.; Litzenberg, K.T.; Patel, D.; Chakravarti, A. NNMT Silencing Activates Tumor Suppressor PP2A, Inactivates Oncogenic STKs, and Inhibits Tumor Forming Ability. Clin. Cancer Res. 2017, 23, 2325–2334. [Google Scholar] [CrossRef]
Clark, P.A.; Iida, M.; Treisman, D.M.; Kalluri, H.; Ezhilan, S.; Zorniak, M.; Wheeler, D.L.; Kuo, J.S. Activation of multiple ERBB family receptors mediates glioblastoma cancer stem-like cell resistance to EGFR-targeted inhibition. Neoplasia 2012, 14, 420–428. [Google Scholar] [CrossRef]
Anlar, B.; Gunel-Ozcan, A. Tenascin-R: Role in the central nervous system. Int. J. Biochem. Cell Biol. 2012, 44, 1385–1389. [Google Scholar] [CrossRef]
Dwyer, C.A.; Bi, W.L.; Viapiano, M.S.; Matthews, R.T. Brevican knockdown reduces late-stage glioma tumor aggressiveness. J. Neurooncol. 2014, 120, 63–72. [Google Scholar] [CrossRef]
Lu, R.; Wu, C.; Guo, L.; Liu, Y.; Mo, W.; Wang, H.; Ding, J.; Wong, E.T.; Yu, M. The role of brevican in glioma: Promoting tumor cell motility in vitro and in vivo. BMC Cancer 2012, 12, 607. [Google Scholar] [CrossRef] [PubMed]
Xia, S.; Lal, B.; Tung, B.; Wang, S.; Goodwin, C.R.; Laterra, J. Tumor microenvironment tenascin-C promotes glioblastoma invasion and negatively regulates tumor proliferation. Neuro-Oncology 2016, 18, 507–517. [Google Scholar] [CrossRef] [PubMed]
Rupp, T.; Langlois, B.; Koczorowska, M.M.; Radwanska, A.; Sun, Z.; Hussenet, T.; Lefebvre, O.; Murdamoothoo, D.; Arnold, C.; Klein, A.; et al. Tenascin-C Orchestrates Glioblastoma Angiogenesis by Modulation of Pro- and Anti-angiogenic Signaling. Cell Rep. 2016, 17, 2607–2619. [Google Scholar] [CrossRef] [PubMed]
Midwood, K.S.; Hussenet, T.; Langlois, B.; Orend, G. Advances in tenascin-C biology. Cell. Mol. Life Sci. 2011, 68, 3175–3199. [Google Scholar] [CrossRef] [PubMed] [Green Version]
El Ayachi, I.; Baeza, N.; Fernandez, C.; Colin, C.; Scavarda, D.; Pesheva, P.; Figarella-Branger, D. KIAA0510, the 3′-untranslated region of the tenascin-R gene, and tenascin-R are overexpressed in pilocytic astrocytomas. Neuropathol. Appl. Neurobiol. 2010, 36, 399–410. [Google Scholar] [CrossRef] [PubMed]
Choi, K.J.; Lee, J.H.; Kim, K.S.; Kang, S.; Lee, Y.S.; Bae, S. Identification of ELAVL4 as a modulator of radiation sensitivity in A549 non-small cell lung cancer cells. Oncol. Rep. 2011, 26, 55–63. [Google Scholar] [CrossRef] [PubMed]
LeBlanc, M.E.; Wang, W.; Chen, X.; Caberoy, N.B.; Guo, F.; Shen, C.; Ji, Y.; Tian, H.; Wang, H.; Chen, R.; et al. Secretogranin III as a disease-associated ligand for antiangiogenic therapy of diabetic retinopathy. J. Exp. Med. 2017, 214, 1029–1047. [Google Scholar] [CrossRef] [Green Version]
Hernandez, M.; Martin, R.; Garcia-Cubillas, M.D.; Maeso-Hernandez, P.; Nieto, M.L. Secreted PLA2 induces proliferation in astrocytoma through the EGF receptor: Another inflammation-cancer link. Neuro-Oncology 2010, 12, 1014–1023. [Google Scholar] [CrossRef]
Banyard, J.; Barrows, C.; Zetter, B.R. Differential regulation of human thymosin beta 15 isoforms by transforming growth factor beta 1. Genes Chromosomes Cancer 2009, 48, 502–509. [Google Scholar] [CrossRef] [Green Version]
Gautier, L.; Cope, L.; Bolstad, B.M.; Irizarry, R.A. Analysis of Affymetrix GeneChip data at the probe level. Bioinformatics 2004, 20, 307–315. [Google Scholar] [CrossRef]
MacDonald, J.W. Affycoretools: Functions Useful for Those Doing Repetitive Analyses with Affymetrix GeneChips. Available online: https://bioconductor.org/packages/release/bioc/html/affycoretools.html (accessed on 23 September 2017).
Wu, Z.J.; Irizarry, R.A.; Gentleman, R.; Martinez-Murillo, F.; Spencer, F. A model-based background adjustment for oligonucleotide expression arrays. J. Am. Stat. Assoc. 2004, 99, 909–917. [Google Scholar] [CrossRef]
Pounds, S.; Morris, S.W. Estimating the occurrence of false positives and false negatives in microarray studies by approximating and partitioning the empirical distribution of p-values. Bioinformatics 2003, 19, 1236–1242. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hallam, A. TensorFlow-Survival-Analysis. Available online: https://github.com/alexhallam/TensorFlow-Survival-Analysis (accessed on 23 September 2017).
Faraggi, D.; Simon, R. A neural network model for survival data. Stat. Med. 1995, 14, 73–82. [Google Scholar] [CrossRef] [PubMed]
Ioffe, S.; Szegedy, C. Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift. In Proceedings of the International Conference on Machine Learning, Lille, France, 6–11 July 2015; pp. 448–456. [Google Scholar]
Wang, L.; Shen, J.; Thall, P.F. A Modified Adaptive Lasso for Identifying Interactions in the Cox Model with the Heredity Constraint. Stat. Probab. Lett. 2014, 93, 126–133. [Google Scholar] [CrossRef] [PubMed]
Suchting, R.; Hebert, E.T.; Ma, P.; Kendzor, D.E.; Businelle, M.S. Using Elastic Net Penalized Cox Proportional Hazards Regression to Identify Predictors of Imminent Smoking Lapse. Nicotine Tob. Res. 2017. [Google Scholar] [CrossRef] [PubMed]
Simon, N.; Friedman, J.; Hastie, T.; Tibshirani, R. Regularization Paths for Cox’s Proportional Hazards Model via Coordinate Descent. J. Stat. Softw. 2011, 39, 1–13. [Google Scholar] [CrossRef] [PubMed]
Guam, X.; Olden, J.D. A new R2-based metric to shed greater insight on variable importance in artificial neural networks. Ecol. Model. 2015, 313, 307–313. [Google Scholar] [CrossRef]
Aguirre-Gamboa, R.; Gomez-Rueda, H.; Martinez-Ledesma, E.; Martinez-Torteya, A.; Chacolla-Huaringa, R.; Rodriguez-Barrientos, A.; Tamez-Pena, J.G.; Trevino, V. SurvExpress: An online biomarker validation tool and database for cancer gene expression data using survival analysis. PLoS ONE 2013, 8, e74250. [Google Scholar] [CrossRef]
Lee, Y.; Scheck, A.C.; Cloughesy, T.F.; Lai, A.; Dong, J.; Farooqi, H.K.; Liau, L.M.; Horvath, S.; Mischel, P.S.; Nelson, S.F. Gene expression analysis of glioblastomas identifies the major molecular basis for the prognostic benefit of younger age. BMC Med. Genom. 2008, 1, 52. [Google Scholar] [CrossRef]
Freije, W.A.; Castro-Vargas, F.E.; Fang, Z.; Horvath, S.; Cloughesy, T.; Liau, L.M.; Mischel, P.S.; Nelson, S.F. Gene expression profiling of gliomas strongly predicts survival. Cancer Res. 2004, 64, 6503–6510. [Google Scholar] [CrossRef]
Gravendeel, L.A.; Kouwenhoven, M.C.; Gevaert, O.; de Rooi, J.J.; Stubbs, A.P.; Duijm, J.E.; Daemen, A.; Bleeker, F.E.; Bralten, L.B.; Kloosterhof, N.K.; et al. Intrinsic gene expression profiles of gliomas are a better predictor of survival than histology. Cancer Res. 2009, 69, 9065–9072. [Google Scholar] [CrossRef] [PubMed]
Nutt, C.L.; Mani, D.R.; Betensky, R.A.; Tamayo, P.; Cairncross, J.G.; Ladd, C.; Pohl, U.; Hartmann, C.; McLaughlin, M.E.; Batchelor, T.T.; et al. Gene expression-based classification of malignant gliomas correlates better with survival than histological classification. Cancer Res. 2003, 63, 1602–1607. [Google Scholar] [PubMed]
Murat, A.; Migliavacca, E.; Gorlia, T.; Lambiv, W.L.; Shay, T.; Hamou, M.F.; de Tribolet, N.; Regli, L.; Wick, W.; Kouwenhoven, M.C.; et al. Stem cell-related “self-renewal” signature and high epidermal growth factor receptor expression associated with resistance to concomitant chemoradiotherapy in glioblastoma. J. Clin. Oncol. 2008, 26, 3015–3024. [Google Scholar] [CrossRef] [PubMed]
Joo, K.M.; Kim, J.; Jin, J.; Kim, M.; Seol, H.J.; Muradov, J.; Yang, H.; Choi, Y.L.; Park, W.Y.; Kong, D.S.; et al. Patient-specific orthotopic glioblastoma xenograft models recapitulate the histopathology and biology of human glioblastomas in situ. Cell Rep. 2013, 3, 260–273. [Google Scholar] [CrossRef] [PubMed]
Phillips, H.S.; Kharbanda, S.; Chen, R.; Forrest, W.F.; Soriano, R.H.; Wu, T.D.; Misra, A.; Nigro, J.M.; Colman, H.; Soroceanu, L.; et al. Molecular subclasses of high-grade glioma predict prognosis, delineate a pattern of disease progression, and resemble stages in neurogenesis. Cancer Cell 2006, 9, 157–173. [Google Scholar] [CrossRef] [PubMed] [Green Version]

Figure 1. Probability distribution of important genes occurring at different network nodes. It is very rare for an important gene to occur in many nodes that are prognostic to survival. Using a threshold of p < 0.01, only genes that occurred at least 10 out of 27 network nodes meet the criteria and are included in the 39-gene signature.

Figure 2. Kaplan–Meier survival fraction versus survival time (months) of the low-risk (green color) and high-risk (red color) groups are well separated using top 39 genes across nine different datasets, including data from seven glioblastoma and three low-grade glioma studies.

Table 1. Frequency analysis of important genes in the 27 deep-learned network nodes at the top hidden layer. Only the top 100 frequently occurring genes are listed for brevity.

Gene	Frequency	Gene	Frequency	Gene	Frequency	Gene	Frequency
TNR	17	DPYSL4	10	MEG3	9	GRB10	8
GAD1	16	EGFR	10	NES	9	KDELR3	8
TMSB15B	15	F13A1	10	NPTX2	9	KIF1A	8
POSTN	15	FBN2	10	NRXN1	9	LSAMP	8
SCG3	15	NEFM	10	NTSR2	9	LYPD1	8
PLA2G2A	14	PTGDS	10	PEG3	9	MMP9	8
NNMT	13	RAB6B	10	PROM1	9	MYT1L	8
CHI3L1	13	RAPGEF4	10	SH3GL3	9	NMNAT2	8
ELAVL4	13	RUNDC3A	10	SOX11	9	NNAT	8
TF	13	SERPINA3	10	SPOCK1	9	NOL4	8
UGT8	13	SH3GL2	10	TMEM35	9	NSG1	8
AQP1	12	SNAP25	10	C4B	8	PLBD1	8
COL6A3	12	TCEAL2	10	SLC16A3	8	RGS1	8
ERBB3	12	TIMP4	10	SOD2	8	RGS17	8
KCNQ2	12	LOC101060835	9	AIM1	8	RGS4	8
LTF	12	ADAM22	9	ANXA1	8	RTN1	8
MEOX2	12	BCAN	9	APOD	8	S100A2	8
PCDH9	12	C1orf61	9	ATP2B2	8	SLC17A7	8
STMN2	12	DDX25	9	ATP6V1G2	8	SRD5A1	8
FCGR2B	11	ETNPPL	9	CFI	8	STC1	8
FGFR3	11	FAM107A	9	DSP	8	STEAP3	8
SLC1A2	11	GABRB1	9	ENPP2	8	STK32B	8
CA10	10	GDF15	9	FCGBP	8	TAC1	8
CXCL14	10	GNAO1	9	FUT9	8	VSNL1	8
CXorf57	10	LGI1	9	FZD6	8	WIF1	8

Table 2. The prognostic value of network node outputs at the top hidden layer are evaluated using Cox proportional hazard model. The hazard ratios (HR) and 95% confidence intervals (95% CI) are listed with corresponding p-value. Thirteen network nodes are statistically significant in overall survival prognosis.

Cox Model with Deep Learning Features	HR (95% CI)	p-Value
Network Node 0	1.26 (0.98–1.62)	0.0718
Network Node 1	1.13 (0.94–1.36)	0.1996
Network Node 2	1.03 (0.81–1.32)	0.7931
Network Node 3	1.15 (0.94–1.41)	0.1681
Network Node 4	0.73 (0.59–0.89)	0.0022
Network Node 5	0.95 (0.77–1.16)	0.5935
Network Node 6	1.13 (0.88–1.44)	0.341
Network Node 7	1.19 (0.97–1.46)	0.0929
Network Node 8	1.71 (1.40–2.08)	<0.0001
Network Node 9	1.02 (0.81–1.29)	0.8505
Network Node 10
≥1.6	0.45 (0.25–0.81)	0.0076
<1.6	1
Network Node 11	0.80 (0.66–0.96)	0.0197
Network Node 12	1.36 (1.11–1.68)	0.0034
Network Node 13	0.93 (0.77–1.14)	0.4994
Network Node 14	1.12 (0.88–1.42)	0.3495
Network Node 15	0.86 (0.67–1.10)	0.2324
Network Node 16	0.57 (0.45–0.72)	<0.0001
Network Node 17	1.35 (1.09–1.67)	0.0056
Network Node 18	0.80 (0.64–1.00)	0.0478
Network Node 19	0.78 (0.64–0.95)	0.0132
Network Node 20	0.91 (0.73–1.15)	0.4437
Network Node 21	1.34 (1.10–1.62)	0.0035
Network Node 22	1.16 (0.95–1.42)	0.1575
Network Node 23	0.77 (0.62–0.97)	0.0281
Network Node 24	1.87 (1.54–2.27)	<0.0001
Network Node 25	1.41 (1.10–1.80)	0.0063
Network Node 26	0.80 (0.65–1.00)	0.0476
Overall Model		<0.0001

Table 3. Combined multivariate Cox proportional hazard model including clinical covariates and deep learning network node covariates to predict overall survival (p-value < 0.0001). The hazard ratios (HR) and 95% confidence intervals (95% CI) are listed with corresponding p-value.

Cox Model with Clinical Covariates and Deep Learning Features	HR (95% CI)	p-Value
Age
≥54 years old	1.50 (1.10–2.03)	0.0098
<54 years old	1
Gender
Male	1.25 (0.92–1.68)	0.1542
Female	1
KPS
≥60	0.35 (0.17–0.72)	0.0042
<60	1
Therapy
Chemoradiation	0.27 (0.12–0.62)	0.0018
Chemotherapy	1.06 (0.35–3.17)	0.9193
Radiation	0.51 (0.22–1.17)	0.1122
Subtype
Proneural	1.70 (1.01–2.87)	0.0464
Classical	1.26 (0.79–2.01)	0.3311
Mesenchymal	1.41 (0.89–2.25)	0.1462
MGMT Methylated	1.18 (0.85–1.62)	0.3181
G-CIMP Methylated	1.03 (0.35–3.06)	0.9553
R132C/R132G/R132H Mutation	1.08 (0.35–3.31)	0.8986
Network Node 0	1.09 (0.80–1.48)	0.5888
Network Node 1	1.10 (0.87–1.39)	0.4348
Network Node 2	1.15 (0.86–1.55)	0.3407
Network Node 3	1.11 (0.86–1.44)	0.4264
Network Node 4	0.77 (0.60–0.99)	0.0387
Network Node 5	0.85 (0.64–1.12)	0.2558
Network Node 6	1.12 (0.80–1.55)	0.514
Network Node 7	1.12 (0.86–1.44)	0.4041
Network Node 8	1.73 (1.36–2.21)	<0.0001
Network Node 9	1.07 (0.81–1.42)	0.6384
Network Node 10
≥1.6	0.44 (0.20–0.95)	0.0363
Network Node 11	0.86 (0.67–1.10)	0.2336
Network Node 12	1.27 (0.99–1.65)	0.0645
Network Node 13	1.04 (0.82–1.32)	0.7484
Network Node 14	1.10 (0.82–1.48)	0.5313
Network Node 15	0.79 (0.57–1.11)	0.1711
Network Node 16	0.64 (0.48–0.86)	0.0029
Network Node 17	1.55 (1.16–2.07)	0.0027
Network Node 18	0.79 (0.60–1.05)	0.1049
Network Node 19	0.86 (0.68–1.08)	0.2001
Network Node 20	0.74 (0.54–1.00)	0.0528
Network Node 21
≥1.6	0.96 (0.55–1.69)	0.8937
Network Node 22	1.22 (0.94–1.58)	0.1327
Network Node 23	0.75 (0.57–1.00)	0.048
Network Node 24	1.66 (1.30–2.12)	<0.0001
Network Node 25	1.49 (1.10–2.01)	0.0101
Network Node 26	0.83 (0.64–1.07)	0.1555
Overall Model		<0.0001

Table 4. Gene list of the 39-gene signature selected based on p < 0.01 occurrence at the deep-learned network nodes at the top hidden layer.

39-Gene Signature
TNR	UGT8	FGFR3	PTGDS
GAD1	AQP1	SLC1A2	RAB6B
TMSB15B	COL6A3	CA10	RAPGEF4
POSTN	ERBB3	CXCL14	RUNDC3A
SCG3	KCNQ2	CXorf57	SERPINA3
PLA2G2A	LTF	DPYSL4	SH3GL2
NNMT	MEOX2	EGFR	SNAP25
CHI3L1	PCDH9	F13A1	TCEAL2
ELAVL4	STMN2	FBN2	TIMP4
TF	FCGR2B	NEFM

Table 5. List of glioblastoma studies used in survival prognosis validation using the gene set discovered by deep learning.

Study Datasets	Samples	Source
Lee Nelson Glioblastoma GSE13041 GPL96	218	Lee [56]
Freije Nelson Glioblastoma GSE4412 GPL96	85	Freije [57]
Gravendeed French Glioblastoma GSE16011	284	Gravendeel [58]
Nutt Louis Glioblastoma BROAD	50	Nutt [59]
Murat Hegi Glioblastoma GSE7696	84	Murat [60]
Joo Kim Jin Kim Seol Nam Glioblastoma GSE42669	58	Joo [61]
Philips Aldape Astrocytoma GSE4271 GPL96	100	Phillips [62]
Brain Low Grade Glioma TCGA 2016	110	TCGA
GBM-TCGA June 2016	148	TCGA
LGG-TCGA-Low Grade Gliomas June 2016	512	TCGA

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Wong, K.K.; Rostomily, R.; Wong, S.T.C. Prognostic Gene Discovery in Glioblastoma Patients using Deep Learning. Cancers 2019, 11, 53. https://doi.org/10.3390/cancers11010053

AMA Style

Wong KK, Rostomily R, Wong STC. Prognostic Gene Discovery in Glioblastoma Patients using Deep Learning. Cancers. 2019; 11(1):53. https://doi.org/10.3390/cancers11010053

Chicago/Turabian Style

Wong, Kelvin K., Robert Rostomily, and Stephen T. C. Wong. 2019. "Prognostic Gene Discovery in Glioblastoma Patients using Deep Learning" Cancers 11, no. 1: 53. https://doi.org/10.3390/cancers11010053

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Prognostic Gene Discovery in Glioblastoma Patients using Deep Learning

Abstract

1. Introduction

2. Results

2.1. Deep Learning Model

2.2. Deep Learning Model Performance Comparison with Penalized Cox Regression Models

2.3. Network Node Parameters Improved the Baseline Cox Proportional Hazard Model

2.4. Prognostic Significance Validation of Gene Set with External Data

3. Discussion

4. Materials and Methods

4.1. Gene Expression Data Analysis

4.2. Deep Learning Model

4.3. Deep Learning Model Performance Comparison with Penalized Cox Regression Models

4.4. Impact of Deep Learning Network Features on Baseline Survival Model

4.5. Identifying Important Genes in Deep Learning Model

4.6. Prognostic Significance Validation of Gene Set with External Data

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI