Journal of Cereal Science 14 (1991) 95-104 Principal Component Analysis - an Efficient Tool for Selection of Wheat Samples with Wide Variation in Properties E. WESTERLUND, R. ANDERSSON, M. HAMALAINEN and P. AMAN Swedish University of Agricultural Sciences, Department of Chemistry, Box 7015, S-750 07 Uppsala, Sweden Received 10 July 1990 Twenty-three variables were studied that described the chemical composition and baking properties of 100 samples of spring and winter wheat. Sixteen of these samples were selected with the aid of principal component analysis (PCA) in such a manner that much of the variation in all the parameters was retained. The selection procedure preserved a larger part of the variation in the original material than selection by random sampling. This was deduced from comparison of ranges recovered with respect to each variable and selection procedure. The principles of PCA are general and the procedure developed can therefore be recommended as one means of solving sample selection problems. Introduction Selection of a number of subsampies from a larger sample set in a way that retains as much as possible of the variability present in the original sample set constitutes an important problem in applied science. One way to select samples is to use random sampling\ but this may produce a high error in subsequent analysis due to unfortunate selection of non-representative samples. Performing the selection in a systematic manner is generally more attractive, therefore, depending on the particular aim of the investigation 1 . When the number of samples and the number of sample variables are large, however, systematic selection based on visual comparison of sample values may become difficult. The baking properties of 100 wheat samples are being studied in this laboratory by evaluating a multitude of chemical and physical parameters. A detailed characterization of soluble fibre components, which are considered to have important baking properties2, became necessary. As a consequence, the original number of wheat samples had to be reduced by selection because of the extensive analytical work that would otherwise be required. A suitable selection procedure was developed using principal component analysis (peA), which is a mathematical multivariate method 3 4 Multivariate methods have been used previously to select samples for multivariate calibration of near-infrared spectrophotometers·- 7 The aim of the present paper is to show that peA can be used to reduce the number 0753-5210/91/040095 + 10 $03.00/0 © 1991 Academic Press Limited