Which econometric method should you use for health policy causal inference?
Jason Shafrin
Senior Managing Director, Center for Healthcare Economics & Policy at FTI Consulting; Adjunct Professor, University of Southern California
TL;DR
A paper by Ress and Wild (2024) provide the following recommendations in answering this question.
How did they arrive at these recommendations? To find out, read on.
Description of plasmode simulation on study methodology
To answer the question "
", one has to make a number of research decisions.?
First, one must decide whether to simulate the effect of a policy intervention or incorporate real-world data into the simulation.? The advantage of the former approach is that we know the truth and can create any data generating scenario we want; because we (the researcher) have ourselves constructed the data generating process, we have a gold standard to compare against and can test out various data generating processes.? The problem with this approach, is its hypothetical nature.? Specifically, Ress and Wild (2024) write:
Many simulation studies…are characterized by relatively simple confounding structures with few variables, leading to varying results depending on the data structure modeled and the methods under consideration...Because the optimal choice for an estimation strategy depends on the research question, data features, population characteristics and method assumptions, simulation results are only applicable to the specific simulation setting.
Instead, the authors opt for a plasmode simulation. What is a plasmode simulation?
In a plasmode simulation, the covariates from a real dataset are used without alteration, while the values for the outcome variables are simulated based on the estimated associations between covariates and outcomes from the original data, ensuring that the true effect size is known. The advantage of this approach is that it preserves the high‐dimensional and complex covariate structure of the source data, providing a simulation environment that closely resembles real‐world conditions.
In short, while the underlying covariates are not changed, researchers can test the robustness of different estimation methods through controlled modifications to the real dataset, such as artificially inserting or removing certain relationships, introducing or removing biases, adding noise, or altering specific variables. This allows for the controlled examination of how statistical methods perform under different known conditions.
A second research decision is to determine which estimation methods should be evaluated. Ress and Wild (2024) consider the following approaches:
领英推荐
Third, the authors must consider how to estimate nuisance parameters. The key nuisance parameters are the propensity score and the outcome model. Estimation of the nuisance parameters was performed using the superlearner package.
...we used the superlearner algorithm implemented in the SuperLearner [R] package (Polley et al., 2021), which allowed us to incorporate non‐parametric approaches. We included the following five algorithms as baselearners: generalized linear model with penalized maximum likelihood (glmnet function) (Friedman et al., 2010), random forest (ranger function) (Wright & Ziegler, 2017), gradient boosting (xgboost function) (Chen et al., 2015), support vector machines (svm function) (...Karatzoglou et al., 2006), and multivariate adaptive regression splines (earth function) (Friedman, 1991).
Fourth, one must consider a specific intervention to evaluate and how to simulate the data. The intervention the authors considered was an German initiative aiming to improve health care in a socially deprived urban area. Specifically, the intervention included (i) cross-sectoral network of health, social and community care providers and (ii) a community health advice and navigation service. (for more details see Rees and Wild 2023). To simulate the plasmode data for this intervention, Ress and Wild (2024) use the following procedure:
Fifth, one must determine the set of performance metrics to use to evaluate the study. The performance metrics considered included:
Based on this approach, the authors find that there is no clear winner:.
We found that TMLE combined with the superlearner performed best in terms of bias and SE, but exhibited shortcomings in terms of CI coverage. When considering all performance measures and outcomes, the combination of matching and subsequent DiD analysis in conjunction with regression for nuisance parameter estimation performed best.
What are the takeaways from this research? The authors nicely lay this out at the end of their article:
You can read the full article here. What do you think of the use of plasmode simulations?
Originally posted at Healthcare Economist.?
The views expressed herein are those of the author and not necessarily the views of?FTI Consulting, Inc., its management, its subsidiaries, its affiliates, or its other professionals.
CEO @ Gencomm.ai - Disruptive AI & ML-powered pricing product suite
1 个月Hey Jason, nice post. A few things to think about: 1) The ensemble "superlearner" should have the property that it will fit at least as well as any base learner it uses, so for low-dimensional data, why not always use the super learner and include base learners that do well these sorts of data? I don't understand why we wouldn't always use the most powerful approach to estimating nuisance parameters, rather than leave it to a researcher decision to switch between regression and approaches that should be guaranteed to beat it. 2) I see they used the R package for the ensemble learner. I would encourage interested folks to look at other automated ML algorithms that follow this base approach (H20, Autogluon, etc.) -- they all handle the encoding of data differently and when you have dates and strings in your data, this will really matter.
Empleado administrativo en Honorable Cámara de Diputados de la Nación
1 个月Hello. Where do you see the next pandemic going to start? What are the traits? What do you have in mind to solve it? How much do vaccines cost financially? Thanks for sharing. Hug from Argentina.