TY - JOUR
T1 - Modeling multi-level survival data in multi-center epidemiological cohort studies
T2 - Applications from the ELAPSE project
AU - Samoli, Evangelia
AU - Rodopoulou, Sophia
AU - Hvidtfeldt, Ulla A.
AU - Wolf, Kathrin
AU - Stafoggia, Massimo
AU - Brunekreef, Bert
AU - Strak, Maciej
AU - Chen, Jie
AU - Andersen, Zorana J.
AU - Atkinson, Richard
AU - Bauwelinck, Mariska
AU - Bellander, Tom
AU - Brandt, Jørgen
AU - Cesaroni, Giulia
AU - Forastiere, Francesco
AU - Fecht, Daniela
AU - Gulliver, John
AU - Hertel, Ole
AU - Hoffmann, Barbara
AU - de Hoogh, Kees
AU - Janssen, Nicole A.H.
AU - Ketzel, Matthias
AU - Klompmaker, Jochem O.
AU - Liu, Shuo
AU - Ljungman, Petter
AU - Nagel, Gabriele
AU - Oftedal, Bente
AU - Pershagen, Göran
AU - Peters, Annette
AU - Raaschou-Nielsen, Ole
AU - Renzi, Matteo
AU - Kristoffersen, Doris T.
AU - Severi, Gianluca
AU - Sigsgaard, Torben
AU - Vienneau, Danielle
AU - Weinmayr, Gudrun
AU - Hoek, Gerard
AU - Katsouyanni, Klea
N1 - Publisher Copyright:
© 2021 The Authors
PY - 2021/2/1
Y1 - 2021/2/1
N2 - Background: We evaluated methods for the analysis of multi-level survival data using a pooled dataset of 14 cohorts participating in the ELAPSE project investigating associations between residential exposure to low levels of air pollution (PM2.5 and NO2) and health (natural-cause mortality and cerebrovascular, coronary and lung cancer incidence). Methods: We applied five approaches in a multivariable Cox model to account for the first level of clustering corresponding to cohort specification: (1) not accounting for the cohort or using (2) indicator variables, (3) strata, (4) a frailty term in frailty Cox models, (5) a random intercept under a mixed Cox, for cohort identification. We accounted for the second level of clustering due to common characteristics in the residential area by (1) a random intercept per small area or (2) applying variance correction. We assessed the stratified, frailty and mixed Cox approach through simulations under different scenarios for heterogeneity in the underlying hazards and the air pollution effects. Results: Effect estimates were stable under approaches used to adjust for cohort but substantially differed when no adjustment was applied. Further adjustment for the small area grouping increased the effect estimates’ standard errors. Simulations confirmed identical results between the stratified and frailty models. In ELAPSE we selected a stratified multivariable Cox model to account for between-cohort heterogeneity without adjustment for small area level, due to the small number of subjects and events in the latter. Conclusions: Our study supports the need to account for between-cohort heterogeneity in multi-center collaborations using pooled individual level data.
AB - Background: We evaluated methods for the analysis of multi-level survival data using a pooled dataset of 14 cohorts participating in the ELAPSE project investigating associations between residential exposure to low levels of air pollution (PM2.5 and NO2) and health (natural-cause mortality and cerebrovascular, coronary and lung cancer incidence). Methods: We applied five approaches in a multivariable Cox model to account for the first level of clustering corresponding to cohort specification: (1) not accounting for the cohort or using (2) indicator variables, (3) strata, (4) a frailty term in frailty Cox models, (5) a random intercept under a mixed Cox, for cohort identification. We accounted for the second level of clustering due to common characteristics in the residential area by (1) a random intercept per small area or (2) applying variance correction. We assessed the stratified, frailty and mixed Cox approach through simulations under different scenarios for heterogeneity in the underlying hazards and the air pollution effects. Results: Effect estimates were stable under approaches used to adjust for cohort but substantially differed when no adjustment was applied. Further adjustment for the small area grouping increased the effect estimates’ standard errors. Simulations confirmed identical results between the stratified and frailty models. In ELAPSE we selected a stratified multivariable Cox model to account for between-cohort heterogeneity without adjustment for small area level, due to the small number of subjects and events in the latter. Conclusions: Our study supports the need to account for between-cohort heterogeneity in multi-center collaborations using pooled individual level data.
KW - Air pollution
KW - Cox model
KW - Frailty models
KW - Health effects
KW - Mixed models
KW - Multi-level analysis
UR - http://www.scopus.com/inward/record.url?scp=85098956014&partnerID=8YFLogxK
U2 - 10.1016/j.envint.2020.106371
DO - 10.1016/j.envint.2020.106371
M3 - Article
C2 - 33422970
AN - SCOPUS:85098956014
SN - 0160-4120
VL - 147
JO - Environment International
JF - Environment International
M1 - 106371
ER -