TY - JOUR
T1 - Simulation study to evaluate when Plasmode simulation is superior to parametric simulation in comparing classification methods on high-dimensional data
AU - Topic group “High-dimensional data” (TG9) of the STRATOS Initiative
AU - Stolte, Marieke
AU - Schreck, Nicholas
AU - Slynko, Alla
AU - Saadati, Maral
AU - Benner, Axel
AU - Rahnenführer, Jörg
AU - Bommert, Andrea
AU - Benner, Axel
AU - Binder, Harald
AU - Boulesteix, Anne Laure
AU - Dobbin, Kevin
AU - Hornung, Roman
AU - Lusa, Lara
AU - Michiels, Stefan
AU - Migliavacca, Eugenia
AU - Rahnenführer, Jörg
AU - Sauerbrei, Willi
AU - Ambrogi, Federico
AU - De Bin, Riccardo
AU - McShane, Lisa
N1 - Publisher Copyright:
© 2025 Stolte et al.
PY - 2025/6/1
Y1 - 2025/6/1
N2 - Simulation studies, especially neutral comparison studies, are crucial for evaluating and comparing statistical methods as they investigate whether methods work as intended and can guide an appropriate method choice. Typically, the term simulation refers to parametric simulation, i.e. computer experiments using pseudo-random numbers. For these, the full data-generating process (DGP) and outcome-generating model (OGM) are known within the simulation. However, the specification of realistic DGPs might be difficult in practice leading to oversimplified assumptions. The problem is more severe for higher-dimensional data as the number of parameters to specify typically increases with the number of variables in the data. Plasmode simulation, which is a combination of resampling covariates from a real-life dataset from the DGP of interest together with a specified OGM is often claimed to solve this problem since no explicit specification of the DGP is necessary. However, this claim is not well supported by empirical results. Here, parametric and Plasmode simulations are compared in the context of a method comparison study for binary classification methods. We focus on studies conducted with some specific data type or application in mind whose true, unknown data-generating mechanism is mimicked. The performance of Plasmode and parametric comparison studies for estimating classifier performance is compared as well as their ability to reproduce the true method ranking. The influence of misspecifications of the DGP on the results of parametric simulation and of misspecifications of the OGM on the results of parametric and Plasmode simulation are investigated. Moreover, different resampling strategies are compared for Plasmode comparison studies. The study finds that misspecifications of the DGP and OGM negatively influence the ability of the comparison studies to estimate the classification performances and method rankings. The best choice of the resampling strategy in Plasmode simulation depends on the concrete scenario.
AB - Simulation studies, especially neutral comparison studies, are crucial for evaluating and comparing statistical methods as they investigate whether methods work as intended and can guide an appropriate method choice. Typically, the term simulation refers to parametric simulation, i.e. computer experiments using pseudo-random numbers. For these, the full data-generating process (DGP) and outcome-generating model (OGM) are known within the simulation. However, the specification of realistic DGPs might be difficult in practice leading to oversimplified assumptions. The problem is more severe for higher-dimensional data as the number of parameters to specify typically increases with the number of variables in the data. Plasmode simulation, which is a combination of resampling covariates from a real-life dataset from the DGP of interest together with a specified OGM is often claimed to solve this problem since no explicit specification of the DGP is necessary. However, this claim is not well supported by empirical results. Here, parametric and Plasmode simulations are compared in the context of a method comparison study for binary classification methods. We focus on studies conducted with some specific data type or application in mind whose true, unknown data-generating mechanism is mimicked. The performance of Plasmode and parametric comparison studies for estimating classifier performance is compared as well as their ability to reproduce the true method ranking. The influence of misspecifications of the DGP on the results of parametric simulation and of misspecifications of the OGM on the results of parametric and Plasmode simulation are investigated. Moreover, different resampling strategies are compared for Plasmode comparison studies. The study finds that misspecifications of the DGP and OGM negatively influence the ability of the comparison studies to estimate the classification performances and method rankings. The best choice of the resampling strategy in Plasmode simulation depends on the concrete scenario.
UR - http://www.scopus.com/inward/record.url?scp=105007094594&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0322887
DO - 10.1371/journal.pone.0322887
M3 - Article
AN - SCOPUS:105007094594
SN - 1932-6203
VL - 20
JO - PLoS ONE
JF - PLoS ONE
IS - 6 June
M1 - e0322887
ER -