Skip to Main Content
2,082
Views
49
CrossRef citations to date
Altmetric

Applications and Case Studies

Stable Weights that Balance Covariates for Estimation With Incomplete Outcome Data

Pages 910-922
Received 01 Mar 2014
Accepted author version posted online: 01 Apr 2015
Published online:07 Nov 2015
 
Translator disclaimer

Weighting methods that adjust for observed covariates, such as inverse probability weighting, are widely used for causal inference and estimation with incomplete outcome data. Part of the appeal of such methods is that one set of weights can be used to estimate a range of treatment effects based on different outcomes, or a variety of population means for several variables. However, this appeal can be diminished in practice by the instability of the estimated weights and by the difficulty of adequately adjusting for observed covariates in some settings. To address these limitations, this article presents a new weighting method that finds the weights of minimum variance that adjust or balance the empirical distribution of the observed covariates up to levels prespecified by the researcher. This method allows the researcher to balance very precisely the means of the observed covariates and other features of their marginal and joint distributions, such as variances and correlations and also, for example, the quantiles of interactions of pairs and triples of observed covariates, thus, balancing entire two- and three-way marginals. Since the weighting method is based on a well-defined convex optimization problem, duality theory provides insight into the behavior of the variance of the optimal weights in relation to the level of covariate balance adjustment, answering the question, how much does tightening a balance constraint increases the variance of the weights? Also, the weighting method runs in polynomial time so relatively large datasets can be handled quickly. An implementation of the method is provided in the new package sbw for R. This article shows some theoretical properties of the resulting weights and illustrates their use by analyzing both a dataset from the 2010 Chilean earthquake and a simulated example.

Additional information

Notes on contributors

José R. Zubizarreta

José R. Zubizarreta, Division of Decision, Risk and Operations, and Statistics Department, Columbia University, 3022 Broadway, 417 Uris Hall, New York, NY 10027 (E-mail: zubizarreta@columbia.edu). The author acknowledges very helpful comments from three referees, an associate editor, and Joseph Ibrahim. The author also thanks Andrew Gelman, Kosuke Imai, Marshall Joffe, Arian Maleki, Luke Miratrix, Marc Ratkovic, María de los Ángeles Resa, James Robins, Paul Rosenbaum, Richard Samworth, Rajen Shah, Chris Skinner, and Dylan Small for valuable help at different stages of this article. This work was supported by a grant from the Alfred P. Sloan Foundation.

Login options

Purchase * Save for later
Online

Article Purchase 24 hours to view or download: USD 44.00 Add to cart

Issue Purchase 30 days to view or download: USD 268.00 Add to cart

* Local tax will be added as applicable