2450 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 57, NO. 10, OCTOBER 2012 Choice Prediction With Semideﬁnite Optimization When Utilities are Correlated Vinit Kumar Mishra, Karthik Natarajan, Member, IEEE, Hua Tao, and Chung-Piaw Teo Abstract—We consider the problem of making choice prediction by optimizing the expectation of maximum utility in discrete choice situations, and propose a discrete choice model which generates choice probabilities through a semideﬁnite program. The choice model, termed as the cross moment model (CMM) is parsimonious in that it uses only the mean and covariance information of the util- ities. It is encouraging that CMM generates reasonable choice es- timates when utilities are correlated even though no distributional assumptions on random utilities are made. We present a few examples in route choice setting and random walk to test the quality of choice prediction using CMM. By capturing correlations among utilities, CMM avoids some of the common behavioral limitations, such as the Independence of Irrel- evant Alternatives and the Invariate Proportion of Substitution, present in several discrete choice models. Being a convex opti- mization problem, it obviates the use of exhaustive simulation in computing choice probabilities for which the multinomial probit (MNP) model is often criticized. CMM can be easily used in design problems such as product-line selection and assortment planning. We exploit CMM to solve a ﬂexible packaging design problem faced by online retailers and warehouses. We use the data provided by a local service part supplier for this design problem and compare the results with Multinomial Logit and MNP models. We ﬁnd that CMM not only captures utility correlations in this problem and provides good designs but also has computational advantages over MNP which uses simulation. Index Terms—Cross moment model (CMM), multinomial probit (MNP). I. INTRODUCTION C ONSIDER a set of alternatives and a set of consumers where each consumer is indexed with at- tributes . The utility that the consumer associates with alterna- tive is . From the point of view of a researcher, this utility is random. Under the standard additive random utility model, the utility of alternative is speciﬁed as Manuscript received April 01, 2010; revised February 07, 2011; accepted September 19, 2011. Date of publication August 02, 2012; date of current ver- sion September 21, 2012. Recommended by Associate Editor C.-H. Chen. V. K. Mishra, H. Tao, and C.-P. Teo are with the Department of Deci- sion Sciences, NUS Business School, National University of Singapore, Singapore, 117591 (e-mail: vinit_mishra@nus.edu.sg; bizteocp@nus.edu.sg; jenny.sendero@gmail.com). K. Natarajan is with the Department of Engineering Systems and De- sign, Singapore University of Technology and Design, Singapore (e-mail: natarajan_karthik@sutd.edu.sg). Color versions of one or more of the ﬁgures in this paper are available online at http://ieeexplore.ieee.org. Digital Object Identiﬁer 10.1109/TAC.2012.2211175 The deterministic component of the utility is the part which the researcher can describe through various observable alternative attributes such as price, cost, etc., and consumer at- tributes such as age, income, etc. All the unobserved factors (id- iosyncracies) are captured through the random component of the utility . Each consumer chooses the alternative with the highest utility with the researcher interested in estimating the probability of a consumer choosing an alternative. The prob- ability that a consumer with attributes chooses alternative (choice probability) is Based on the assumptions on the joint distribution of the random error terms, varying choice models can be generated that are consistent with the utility maximizing behavior of consumers. For ease of exposition, we focus on customers with ﬁxed attributes and suppress the dependence of the results on . When are i.i.d. type-1 extreme value distributed random variables, the choice model is the Multinomial Logit (MNL) model, popularized by Luce [20] and McFadden [22], among others. Under MNL, the cumulative distribution function of each error term is assumed to be with the choice probability equal to This choice model however suffers from the Independence of Irrelevant Alternatives (IIA) property, namely, the ratio of choice probabilities for any two alternatives is unaffected by the presence of other alternatives. When alternatives share many common attributes, MNL tends to exaggerate the market shares. There are plenty of examples in practice where the utility evaluation is not independent across alternatives. In transportation choice problems such as airline networks (Bront et al. [5]), slight variation in features are used to distinguish alternatives. In general, in these problem settings, different resources are combined to provide for the conﬁguration of different alternatives (e.g., each resource corresponds to a single-leg ﬂight, and an alternative is deﬁned as an itinerary and fare-class combination). The sharing of common resources can result in high correlations in utilities among the alternatives. In these circumstances, a model which ignores the correlation among the utilities associated with the alternatives can give inaccurate choice probabilities. 0018-9286/$31.00 © 2012 IEEE