PSYCHOMETRIKA—VOL. 76, NO. 4, 670–690 OCTOBER 2011 DOI : 10.1007/ S11336-011-9224-6 BIASES AND STANDARD ERRORS OF STANDARDIZED REGRESSION COEFFICIENTS KE-HAI YUAN UNIVERSITY OF NOTRE DAME WAI CHAN THE CHINESE UNIVERSITY OF HONG KONG The paper obtains consistent standard errors (SE) and biases of order O(1/n) for the sample stan- dardized regression coefﬁcients with both random and given predictors. Analytical results indicate that the formulas for SEs given in popular text books are consistent only when the population value of the regression coefﬁcient is zero. The sample standardized regression coefﬁcients are also biased in general, although it should not be a concern in practice when the sample size is not too small. Monte Carlo results imply that, for both standardized and unstandardized sample regression coefﬁcients, SE estimates based on asymptotics tend to under-predict the empirical ones at smaller sample sizes. Key words: asymptotics, bias, consistency, Monte Carlo. 1. Introduction Linear regression is typically the ﬁrst model introduced in statistical text books. It is also one of the most widely used statistical methods across all disciplines. Statistical inference for regression models includes obtaining a conﬁdence interval for each regression coefﬁcient and performing an F -test for the overall signiﬁcance of all the explanatory variables. There also exist various diagnostic tools for properly applying regression models. In addition to the regression coefﬁcients, the standardized regression coefﬁcients are also of substantial interest in practice (Kelley and Maxwell, 2003). This is especially true in social sci- ences where variables are typically measured in different and arbitrary units. In the psychometric literature, standardized regression coefﬁcients are called Beta-coefﬁcients while the conventional regression coefﬁcients are called B-coefﬁcients. The importance of the standardized regression coefﬁcients can be testiﬁed by its coverage in popular statistical textbooks (e.g., Cohen, Cohen, West, & Aiken, 2003; Harris, 2001; Hays, 1994) as well as most widely used statistical software (e.g., SAS, SPSS). However, our knowledge of standardized regression coefﬁcients is very lim- ited. For example, there does not exist a formula to properly estimate the standard errors (SEs) of the sample standardized regression coefﬁcients. We also do not know whether the sample stan- dardized coefﬁcients are biased or not. The purpose of this paper is to study SEs and biases of the sample standardized regression coefﬁcients. In regression analysis, the most widely used statistic is probably the t -statistic for testing the null hypothesis that a regression coefﬁcient is zero. Under the assumptions of given predictors and normally distributed errors, the t -statistic follows a Student t -distribution. However, data in social sciences are seldom normally distributed (see Micceri, 1989) and the predictors are typi- cally not controllable. Nevertheless, linear regression is frequently used in social science research when studying the relationship of a response with potential predictors. Thus, random predictors This research was supported by Grants DA00017 and DA01070 from the National Institute on Drug Abuse. Requests for reprints should be sent to Ke-Hai Yuan, University of Notre Dame, Notre Dame, IN 46556, USA. E-mail: kyuan@nd.edu © 2011 The Psychometric Society 670