Prediction of breast cancer treatment–induced fatigue by machine learning using genome-wide association data

Sangkyu Lee, Joseph O. Deasy, Jung Hun Oh, Antonio Di Meglio, Agnes Dumas, Gwenn Menvielle, Cecile Charles, Sandrine Boyault, Marina Rousseau, Celine Besse, Emilie Thomas, Anne Boland, Paul Cottu, Olivier Tredan, Christelle Levy, Anne Laure Martin, Sibille Everhard, Patricia A. Ganz, Ann H. Partridge, Stefan MichielsJean François Deleuze, Fabrice Andre, Ines Vaz-Luis

    Résultats de recherche: Contribution à un journalArticleRevue par des pairs

    16 Citations (Scopus)

    Résumé

    Background: We aimed at predicting fatigue after breast cancer treatment using machine learning on clinical covariates and germline genome-wide data. Methods: We accessed germline genome-wide data of 2799 early-stage breast cancer patients from the Cancer Toxicity study (NCT01993498). The primary endpoint was defined as scoring zero at diagnosis and higher than quartile 3 at 1 year after primary treatment completion on European Organization for Research and Treatment of Cancer quality-of-life questionnaires for Overall Fatigue and on the multidimensional questionnaire for Physical, Emotional, and Cognitive fatigue. First, we tested univariate associations of each endpoint with clinical variables and genome-wide variants. Then, using preselected clinical (false discovery rate < 0.05) and genomic (P < .001) variables, a multivariable preconditioned random-forest regression model was built and validated on a hold-out subset to predict fatigue. Gene set enrichment analysis identified key biological correlates (MetaCore). All statistical tests were 2-sided. Results: Statistically significant clinical associations were found only with Emotional and Cognitive Fatigue, including receipt of chemotherapy, anxiety, and pain. Some single nucleotide polymorphisms had some degree of association (P < .001) with the different fatigue endpoints, although there were no genome-wide statistically significant (P < 5.00 × 10-8) associations. Only for Cognitive Fatigue, the predictive ability of the genomic multivariable model was statistically significantly better than random (area under the curve ¼ 0.59, P ¼ .01) and marginally improved with clinical variables (area under the curve ¼ 0.60, P ¼ .005). Single nucleotide polymorphisms found to be associated (P < .001) with Cognitive Fatigue belonged to genes linked to inflammation (false discovery rate adjusted P ¼ .03), cognitive disorders (P ¼ 1.51 × 10-12), and synaptic transmission (P ¼ 6.28 × 10-8). Conclusions: Genomic analyses in this large cohort of breast cancer survivors suggest a possible genetic role for severe Cognitive Fatigue that warrants further exploration.

    langue originaleAnglais
    Numéro d'articlePKAA039
    journalJNCI Cancer Spectrum
    Volume4
    Numéro de publication5
    Les DOIs
    étatPublié - 1 janv. 2020

    Contient cette citation