Research Article Received 12 August 2011, Accepted 24 March 2012 Published online in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/sim.5417 Testing goodness of ﬁt in regression: a general approach for speciﬁed alternatives Aldo Solari, a * † Saskia le Cessie b,c and Jelle J. Goeman c When ﬁtting generalized linear models or the Cox proportional hazards model, it is important to have tools to test for lack of ﬁt. Because lack of ﬁt comes in all shapes and sizes, distinguishing among different types of lack of ﬁt is of practical importance. We argue that an adequate diagnosis of lack of ﬁt requires a speciﬁed alternative model. Such speciﬁcation identiﬁes the type of lack of ﬁt the test is directed against so that if we reject the null hypothesis, we know the direction of the departure from the model. The goodness-of-ﬁt approach of this paper allows to treat different types of lack of ﬁt within a uniﬁed general framework and to consider many existing tests as special cases. Connections with penalized likelihood and random effects are discussed, and the application of the proposed approach is illustrated with medical examples. Tailored functions for goodness-of-ﬁt testing have been implemented in the R package globaltest. Copyright © 2012 John Wiley & Sons, Ltd. Keywords: goodness of ﬁt; logistic regression; generalized linear models; Cox proportional hazards model 1. Introduction A goodness-of-ﬁt test addresses the question: ‘Is there evidence of inconsistency of data with a statistical model?’ However, a departure from the model may happen in different directions. A linear model, for example, may fail because transformations of covariates are required, or because interaction effects have been missed, or both. Distinguishing the different types of lack of ﬁt is of practical importance: if we ﬁnd evidence against the model, we generally also want to know why the model does not ﬁt. We argue that an adequate diagnosis of lack of ﬁt requires the speciﬁcation of the alternative model. There are two aspects to this. Firstly, the alternative represents the type of lack of ﬁt of interest and will result in a test statistic sensitive to it, because general criteria such as the Neyman–Pearson theory are applicable. Secondly, the alternative model can be ﬁtted and interpreted, giving some guide as to the type of lack of ﬁt that may be present. A goodness-of-ﬁt test should, therefore, be speciﬁc about the type of lack of ﬁt it is directed against. However, most of the goodness-of-ﬁt tests in routine use and provided in standard software either leave the alternative model unspeciﬁed or formulate a very particular alternative. An example is the Hosmer–Lemeshow [1] test for logistic regression. It is unspeciﬁed against which type of lack of ﬁt it is especially sensitive, and there is no alternative model to ﬁt. An example of a test with a very particular alternative is the F test for the simple linear model against quadratic regression. This very speciﬁc alter- native has the drawback that more complex relationships between the response and the covariate may be overlooked. a Department of Statistics, University of Milano-Bicocca, via Bicocca degli Arcimboldi 8, 20126 Milan, Italy b Department of Clinical Epidemiology, Leiden University Medical Center, PO Box 9600, 2300 RC Leiden, The Netherlands c Department of Medical Statistics and Bioinformatics, Leiden University Medical Center, PO Box 9600, 2300 RC Leiden, The Netherlands *Correspondence to: Aldo Solari, Department of Statistics, University of Milano-Bicocca, via Bicocca degli Arcimboldi 8, 20126 Milan, Italy. † E-mail: aldo.solari@unimib.it Copyright © 2012 John Wiley & Sons, Ltd. Statist. Med. 2012