TY - GEN
T1 - On the Use of Neural Networks with Censored Time-to-Event Data
AU - Roblin, Elvire
AU - Cournede, Paul Henry
AU - Michiels, Stefan
N1 - Publisher Copyright:
© 2020, Springer Nature Switzerland AG.
PY - 2020/1/1
Y1 - 2020/1/1
N2 - Motivation: The objective of this work is to confront artificial neural network models with time-to-event data, using specific ways to handle censored observations such as pseudo-observations and tailored loss functions. Methods: Different neural network models were compared. Cox-CC (Kvamme et al., 2019) uses a loss function based on a case-control approximation. DeepHit (Lee et al., 2019) is a model that estimates the probability mass function and combines log-likelihood with a ranking loss. DNNSurv (Zhao et al., 2019) circumvents the problem of censoring by using pseudo-observations. We also proposed other ways of computing pseudo-observations. We investigated the prediction ability of these models using data simulated from an Accelerated Failure Time model (Friedman et al., 2001), with different censoring rates. We simulated 100 data sets of 4,000 samples and 20 variables each, with pairwise interactions and non-linear effects of random subsets of these variables. Models were compared using the concordance index and integrated Brier score. We applied the methods to the METABRIC breast cancer data set, including 1,960 patients, 6 clinical covariates and the expression of 863 genes. Results: In the simulation study, we obtained the highest c-indices and lower integrated Brier score with CoxTime for low censoring and pseudo-discrete with high censoring. On the METABRIC data, the neural networks obtained comparable 5-year and 10-year discrimination performances with slightly higher values for the models based on optimised pseudo-observations. Availability: https://github.com/eroblin/NN_Pseudobs
AB - Motivation: The objective of this work is to confront artificial neural network models with time-to-event data, using specific ways to handle censored observations such as pseudo-observations and tailored loss functions. Methods: Different neural network models were compared. Cox-CC (Kvamme et al., 2019) uses a loss function based on a case-control approximation. DeepHit (Lee et al., 2019) is a model that estimates the probability mass function and combines log-likelihood with a ranking loss. DNNSurv (Zhao et al., 2019) circumvents the problem of censoring by using pseudo-observations. We also proposed other ways of computing pseudo-observations. We investigated the prediction ability of these models using data simulated from an Accelerated Failure Time model (Friedman et al., 2001), with different censoring rates. We simulated 100 data sets of 4,000 samples and 20 variables each, with pairwise interactions and non-linear effects of random subsets of these variables. Models were compared using the concordance index and integrated Brier score. We applied the methods to the METABRIC breast cancer data set, including 1,960 patients, 6 clinical covariates and the expression of 863 genes. Results: In the simulation study, we obtained the highest c-indices and lower integrated Brier score with CoxTime for low censoring and pseudo-discrete with high censoring. On the METABRIC data, the neural networks obtained comparable 5-year and 10-year discrimination performances with slightly higher values for the models based on optimised pseudo-observations. Availability: https://github.com/eroblin/NN_Pseudobs
KW - Neural networks
KW - Pseudo-observations
KW - Survival data
UR - http://www.scopus.com/inward/record.url?scp=85097806438&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-64511-3_6
DO - 10.1007/978-3-030-64511-3_6
M3 - Conference contribution
AN - SCOPUS:85097806438
SN - 9783030645106
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 56
EP - 67
BT - Mathematical and Computational Oncology - Second International Symposium, ISMCO 2020, 2020, Proceedings
A2 - Bebis, George
A2 - Alekseyev, Max
A2 - Cho, Heyrim
A2 - Gevertz, Jana
A2 - Rodriguez Martinez, Maria
PB - Springer Science and Business Media Deutschland GmbH
T2 - 2nd International Symposium on Mathematical and Computational Oncology, ISMCO 2020
Y2 - 8 October 2020 through 10 October 2020
ER -