Prediction models in multicenter studies: methodological aspects and current state of the art

Wynants, Laure; Van Huffel, Sabine; Van Calster, Ben

doi:10.1186/2049-3258-73-S1-O3

Volume 73 Supplement 1

Methods in Epidemiology Symposium

Oral presentation
Open access
Published: 17 September 2015

Prediction models in multicenter studies: methodological aspects and current state of the art

Laure Wynants¹,
Sabine Van Huffel¹ &
Ben Van Calster¹

Archives of Public Health volume 73, Article number: O3 (2015) Cite this article

1232 Accesses
Metrics details

Increasingly, multicenter datasets are being used to develop or evaluate clinical risk prediction models. Such models estimate an individual's probability that a certain disease or condition is present (diagnostic model) or that an event will occur in the future (prognostic model). Although multicenter studies enhance the generalizability of the model, the clustered nature of the data poses several methodological challenges. We will provide an up to date overview of good practices to overcome these challenges.

When determining the required sample size, the number of events per candidate variable (EPV) is crucial to prevent overfitting when building a prediction model. We extend the EPV guidelines to multicenter studies, acknowledging the clustered nature of the data. During data collection, measurements of variables may differ between centers due to various reasons, such as subjectivity of measurements, differences in equipment and differences in patient populations. We show how the residual intraclass correlation can be used to quantify the intercenter variability. When building a prediction model, the clustered nature of the data should be taken into account during the data analysis, e.g. by using mixed effect models and variables at the center level. Only mixed effect regression can result in a model that is simultaneously calibrated (i.e. gives accurate predicted probabilities) at the center level and the population level. We give the example of the ADNEX model that was built to distinguish between several types of adnexal masses. In the end, the performance of models may differ between centers. We present how to evaluate the predictive performance of models in clustered data and show extensions to existing techniques to evaluate discrimination, calibration and clinical utility, among others by the use of meta-analytic techniques.

Author information

Authors and Affiliations

KU Leuven, Leuven, Belgium
Laure Wynants, Sabine Van Huffel & Ben Van Calster

Authors

Laure Wynants
View author publications
You can also search for this author in PubMed Google Scholar
Sabine Van Huffel
View author publications
You can also search for this author in PubMed Google Scholar
Ben Van Calster
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Wynants, L., Van Huffel, S. & Van Calster, B. Prediction models in multicenter studies: methodological aspects and current state of the art. Arch Public Health 73 (Suppl 1), O3 (2015). https://doi.org/10.1186/2049-3258-73-S1-O3

Download citation

Published: 17 September 2015
DOI: https://doi.org/10.1186/2049-3258-73-S1-O3

Methods in Epidemiology Symposium

Prediction models in multicenter studies: methodological aspects and current state of the art

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Keywords

Archives of Public Health

Contact us

Methods in Epidemiology Symposium

Prediction models in multicenter studies: methodological aspects and current state of the art

Author information

Authors and Affiliations

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Archives of Public Health

Contact us