Quant Market Econ (2006) 4:267–287 DOI 10.1007/s11129-005-9000-y Data pruning in consumer choice models Elaine L. Zanutto · Eric T. Bradlow Received: 15 April 2004 / Accepted: 2 August 2005 / Published online: 8 August 2006 C Springer Science + Business Media, LLC 2006 Abstract Common, if not ubiquitous, Marketing practice when estimating models for scanner panel data is to: (a) observe the data, (b) prune the data to a “manageable” number of brands or SKUs, and (c) fit models to the remaining data. We demonstrate that such pruning practice can lead to significantly different (and potentially biased) elasticities, and hence different managerial/practical outcomes, especially in the con- text of model misspecification. We first justify our claims theoretically by writing the general problem in a classic missing-data framework and demonstrate that commonly used pruning mechanisms (gleaned from current academic Marketing literature) can lead to a nonignorable missing data mechanism. Secondly, we summarize an exten- sive set of simulations that were run to understand the driving factors of that bias. The results indicate much greater pruning bias in those cases where model fit is poor (small R 2 ), random utility errors are correlated with the covariates, or the model is misspecified (e.g., a homogeneous logit is specified when a mixed-logit is true). Em- pirically, we also demonstrate our findings on the well-cited and highly utilized fabric softener data of Fader and Hardie (1996). Our empirical findings suggest a number of estimates that vary according to the way in which the data is pruned including the magnitude of market mix and attribute elasticities, and purchase probabilities, but that the pruning effect is smaller for better fitting models. Keywords Nonignorable missing data mechanism . Probabilistic sampling . Scanner panel data . Multinomial logit models . Mixed-logit models E. L. Zanutto () Statistics Department, The Wharton School of the University of Pennsylvania, 466 Jon M. Huntsman Hall, 3730 Walnut Street, Philadelphia, PA 19104, USA e-mail: zanutto@wharton.upenn.edu E. T. Bradlow Marketing Department, The Wharton School of the University of Pennsylvania, 700 Jon M. Huntsman Hall, 3730 Walnut Street, Philadelphia, PA 19104, USA e-mail: ebradlow@wharton.upenn.edu Springer