Augmenting Pre-Analysis Plans with Machine Learning
Jens Ludwig, Sendhil Mullainathan, and Jann Spiess
AEA Papers and Proceedings.
May 2019, Vol. 109, No. :
Pages 71-76
Augmenting Pre-Analysis Plans with Machine Learning†
JensLudwig1, SendhilMullainathan2 and JannSpiess3
1University of Chicago, 1155 East 60th Street, Chicago, IL 60637, and NBER (email: [email protected])
2Chicago Booth School of Business, 5807 South Woodlawn Avenue, Chicago, IL 60637, and NBER (email: [email protected])
3Microsoft Research New England, 1 Memorial Drive, Cambridge, MA 02142 (email: [email protected])
Abstract
Concerns about the dissemination of spurious results have led to calls for pre-analysis plans (PAPs) to avoid ex-post “p-hacking.” But often the conceptual hypotheses being tested do not imply the level of specificity required for a PAP. In this paper we suggest a framework for PAPs that capitalize on the availability of causal machine-learning (ML) techniques, in which researchers combine specific aspects of the analysis with ML for the flexible estimation of unspecific remainders. A “cheap-lunch” result shows that the inclusion of ML produces limited worst-case costs in power, while offering a substantial upside from systematic specification searches.