Robustness properties of a robust PLS regression method K. Vanden Branden and M. Hubert December 10, 2003 Abstract The presence of multicollinearity in regression data is no exception in real life ex- amples. Instead of applying ordinary regression methods, biased regression techniques such as Principal Component Regression and Ridge Regression have been developed to cope with such data sets. In this paper we consider Partial Least Squares (PLS) regres- sion by means of the SIMPLS algorithm. Because the SIMPLS algorithm is based on the empirical variance-covariance matrix of the data and on least squares regression, outliers have a damaging effect on the estimates. To reduce this pernicious effect of outliers, we propose to replace the empirical variance-covariance matrix in SIMPLS by a robust covariance estimator. We derive the influence function of the resulting PLS weight vectors and the regression estimates, and conclude that they will be bounded if the robust covariance estimator has a bounded influence function. Also the breakdown value is inherited from the robust estimator. We illustrate the results using the MCD estimator and the reweighted MCD estimator (RMCD) for low-dimensional data sets. Also some empirical properties are provided for a high-dimensional data set. Key words: Partial Least Squares Regression, SIMPLS, Influence Function, Minimum Covariance Determinant, Robustness. * Assistant, Department of Mathematics, Katholieke Universiteit Leuven, W. de Croylaan 54, B-3001 Leuven, Belgium, Karlien.VandenBranden@wis.kuleuven.ac.be. Assistant Professor, Department of Mathematics, Katholieke Universiteit Leuven, W. de Croylaan 54, B-3001 Leuven, Belgium, Mia.Hubert@wis.kuleuven.ac.be 1