9 Additive Models, Trees, and Related Methods In this chapter we begin our discussion of some speciﬁc methods for super- vised learning. These techniques each assume a (diﬀerent) structured form for the unknown regression function, and by doing so they ﬁnesse the curse of dimensionality. Of course, they pay the possible price of misspecifying the model, and so in each case there is a tradeoﬀ that has to be made. They take oﬀ where Chapters 3–6 left oﬀ. We describe ﬁve related techniques: generalized additive models, trees, multivariate adaptive regression splines, the patient rule induction method, and hierarchical mixtures of experts. 9.1 Generalized Additive Models Regression models play an important role in many data analyses, providing prediction and classiﬁcation rules, and data analytic tools for understand- ing the importance of diﬀerent inputs. Although attractively simple, the traditional linear model often fails in these situations: in real life, eﬀects are often not linear. In earlier chapters we described techniques that used predeﬁned basis functions to achieve nonlinearities. This section describes more automatic ﬂexible statistical methods that may be used to identify and characterize nonlinear regression eﬀects. These methods are called “generalized additive models.” In the regression setting, a generalized additive model has the form E(Y |X 1 ,X 2 ,...,X p )= α + f 1 (X 1 )+ f 2 (X 2 )+ ··· + f p (X p ). (9.1) © Springer Science + Business Media, LLC 2009 T. Hastie et al., The Elements of Statistical Learning, Second Edition, 295 DOI: 10.1007/b94608_9,