PSYCHOMETRIKAVOL. 76, NO. 4, 670–690 OCTOBER 2011 DOI : 10.1007/ S11336-011-9224-6 BIASES AND STANDARD ERRORS OF STANDARDIZED REGRESSION COEFFICIENTS KE-HAI YUAN UNIVERSITY OF NOTRE DAME WAI CHAN THE CHINESE UNIVERSITY OF HONG KONG The paper obtains consistent standard errors (SE) and biases of order O(1/n) for the sample stan- dardized regression coefficients with both random and given predictors. Analytical results indicate that the formulas for SEs given in popular text books are consistent only when the population value of the regression coefficient is zero. The sample standardized regression coefficients are also biased in general, although it should not be a concern in practice when the sample size is not too small. Monte Carlo results imply that, for both standardized and unstandardized sample regression coefficients, SE estimates based on asymptotics tend to under-predict the empirical ones at smaller sample sizes. Key words: asymptotics, bias, consistency, Monte Carlo. 1. Introduction Linear regression is typically the first model introduced in statistical text books. It is also one of the most widely used statistical methods across all disciplines. Statistical inference for regression models includes obtaining a confidence interval for each regression coefficient and performing an F -test for the overall significance of all the explanatory variables. There also exist various diagnostic tools for properly applying regression models. In addition to the regression coefficients, the standardized regression coefficients are also of substantial interest in practice (Kelley and Maxwell, 2003). This is especially true in social sci- ences where variables are typically measured in different and arbitrary units. In the psychometric literature, standardized regression coefficients are called Beta-coefficients while the conventional regression coefficients are called B-coefficients. The importance of the standardized regression coefficients can be testified by its coverage in popular statistical textbooks (e.g., Cohen, Cohen, West, & Aiken, 2003; Harris, 2001; Hays, 1994) as well as most widely used statistical software (e.g., SAS, SPSS). However, our knowledge of standardized regression coefficients is very lim- ited. For example, there does not exist a formula to properly estimate the standard errors (SEs) of the sample standardized regression coefficients. We also do not know whether the sample stan- dardized coefficients are biased or not. The purpose of this paper is to study SEs and biases of the sample standardized regression coefficients. In regression analysis, the most widely used statistic is probably the t -statistic for testing the null hypothesis that a regression coefficient is zero. Under the assumptions of given predictors and normally distributed errors, the t -statistic follows a Student t -distribution. However, data in social sciences are seldom normally distributed (see Micceri, 1989) and the predictors are typi- cally not controllable. Nevertheless, linear regression is frequently used in social science research when studying the relationship of a response with potential predictors. Thus, random predictors This research was supported by Grants DA00017 and DA01070 from the National Institute on Drug Abuse. Requests for reprints should be sent to Ke-Hai Yuan, University of Notre Dame, Notre Dame, IN 46556, USA. E-mail: kyuan@nd.edu © 2011 The Psychometric Society 670