TY - JOUR
T1 - Assessing risk prediction models using individual participant data from multiple studies
AU - The Emerging Risk Factors Collaboration
AU - Pennells, Lisa
AU - Kaptoge, Stephen
AU - White, Ian R.
AU - Thompson, Simon G.
AU - Wood, Angela M.
AU - Tipping, Robert W.
AU - Folsom, Aaron R.
AU - Couper, David J.
AU - Ballantyne, Christie M.
AU - Coresh, Josef
AU - Goya Wannamethee, S.
AU - Morris, Richard W.
AU - Kiechl, Stefan
AU - Willeit, Johann
AU - Willeit, Peter
AU - Schett, Georg
AU - Ebrahim, Shah
AU - Lawlor, Debbie A.
AU - Yarnell, John W.
AU - Gallacher, John
AU - Cushman, Mary
AU - Psaty, Bruce M.
AU - Tracy, Russ
AU - Tybjærg-Hansen, Anne
AU - Frikke-Schmidt, Ruth
AU - Benn, Marianne
AU - Nordestgaard, Børge G.
AU - Price, Jackie F.
AU - Lee, Amanda J.
AU - McLachlan, Stela
AU - Khaw, Kay Tee
AU - Wareham, Nicholas J.
AU - Brenner, Hermann
AU - Schöttker, Ben
AU - Müller, Heiko
AU - Rothenbacher, Dietrich
AU - Jansson, Jan Håkan
AU - Wennberg, Patrik
AU - Salomaa, Veikko
AU - Harald, Kennet
AU - Jousilahti, Pekka
AU - Vartiainen, Erkki
AU - Woodward, Mark
AU - D'Agostino, Ralph B.
AU - Wolf, Philip A.
AU - Vasan, Ramachandran S.
AU - Benjamin, Emelia J.
AU - Bladbjerg, Else Marie
AU - Jørgensen, Torben
AU - Ninomiya, Toshiharu
N1 - Funding Information:
This work was supported the United Kingdom Medical Research Council (grant G0701619 and Unit Programme U105260558). The Emerging Risk Factors Collaboration Coordinating Centre was supported by the British Heart Foundation (grant RG/08/014), the Medical Research Council, the United Kingdom National Institute of Health Research Cambridge Biomedical Research Centre, a specific grant from the Bupa Foundation, and an unrestricted educational grant from GlaxoSmithKline. Various sources have supported recruitment, follow-up, and laboratory measurements in the cohorts contributing to the Emerging Risk Factors Collaboration. Investigators in several of these studies have contributed to a list of relevant funding sources (http://ceu.phpc.cam.ac.uk/research/erfc/studies/).
Publisher Copyright:
© The Author 2013.
PY - 2014/3/1
Y1 - 2014/3/1
N2 - Individual participant time-to-event data from multiple prospective epidemiologic studies enable detailed investigation into the predictive ability of risk models. Here we address the challenges in appropriately combining such information across studies. Methods are exemplified by analyses of log C-reactive protein and conventional risk factors for coronary heart disease in the Emerging Risk Factors Collaboration, a collation of individual data from multiple prospective studies with an average follow-up duration of 9.8 years (dates varied).We derive risk prediction models using Cox proportional hazards regression analysis stratified by study and obtain estimates of risk discrimination, Harrell’s concordance index, and Royston’s discrimination measure within each study; we then combine the estimates across studies using aweighted meta-analysis. Various weighting approaches are compared and lead us to recommend using the number of events in each study. We also discuss the calculation of measures of reclassification for multiple studies. We further show that comparison of differences in predictive ability across subgroups should be based only on within-study information and that combining measures of risk discrimination from casecontrol studies and prospective studies is problematic. The concordance index and discrimination measure gave qualitatively similar results throughout. While the concordance index was very heterogeneous between studies, principally because of differing age ranges, the increments in the concordance index from adding log C-reactive protein to conventional risk factors were more homogeneous.
AB - Individual participant time-to-event data from multiple prospective epidemiologic studies enable detailed investigation into the predictive ability of risk models. Here we address the challenges in appropriately combining such information across studies. Methods are exemplified by analyses of log C-reactive protein and conventional risk factors for coronary heart disease in the Emerging Risk Factors Collaboration, a collation of individual data from multiple prospective studies with an average follow-up duration of 9.8 years (dates varied).We derive risk prediction models using Cox proportional hazards regression analysis stratified by study and obtain estimates of risk discrimination, Harrell’s concordance index, and Royston’s discrimination measure within each study; we then combine the estimates across studies using aweighted meta-analysis. Various weighting approaches are compared and lead us to recommend using the number of events in each study. We also discuss the calculation of measures of reclassification for multiple studies. We further show that comparison of differences in predictive ability across subgroups should be based only on within-study information and that combining measures of risk discrimination from casecontrol studies and prospective studies is problematic. The concordance index and discrimination measure gave qualitatively similar results throughout. While the concordance index was very heterogeneous between studies, principally because of differing age ranges, the increments in the concordance index from adding log C-reactive protein to conventional risk factors were more homogeneous.
UR - http://www.scopus.com/inward/record.url?scp=84896692640&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84896692640&partnerID=8YFLogxK
U2 - 10.1093/aje/kwt298
DO - 10.1093/aje/kwt298
M3 - Article
C2 - 24366051
AN - SCOPUS:84896692640
SN - 0002-9262
VL - 179
SP - 621
EP - 632
JO - American journal of epidemiology
JF - American journal of epidemiology
IS - 5
ER -