Field Crops Research 119 (2010) 277–289
Contents lists available at ScienceDirect
Field Crops Research
journal homepage: www.elsevier.com/locate/fcr
Establishing a soybean germplasm core collection
Marcelo F. Oliveira
a,∗
, Randall L. Nelson
b
, Isaias O. Geraldi
c
, Cosme D. Cruz
d
,
José Francisco F. de Toledo
a
a
Embrapa Soybean, P.O. Box 231, Londrina, PR 86001-970, Brazil
b
USDA-Agricultural Research Service, Soybean/Maize Germplasm, Pathology, and Genetics Research Unit, Dept. of Crop Sciences, 1101 W. Peabody Dr., Univ. of Illinois, Urbana, IL
61801, United States
c
University of São Paulo-Faculdade de Agronomia-ESALQ, Dept. of Genetics, P.O. Box 83, Piracicaba, SP 13400-970, Brazil
d
Federal University of Vic ¸ osa, Faculdade de Agronomia, Avenida Peter Henry Rolfs, s/n Campus Universitário, Vic ¸ osa, MG 36570-000, Brazil
article info
Article history:
Received 9 March 2010
Received in revised form 23 July 2010
Accepted 25 July 2010
Keywords:
Glycine max
Genetic diversity
Germplasm bank
Sampling strategies
abstract
Core collections are of strategic importance as they allow the use of a small part of a germplasm collection
that is representative of the total collection. The objective of this study was to develop a soybean core
collection of the USDA Soybean Germplasm Collection by comparing the results of random, proportional,
logarithmic, multivariate proportional and multivariate logarithmic sampling strategies. All but the ran-
dom sampling strategy used stratification of the entire collection based on passport data and maturity
group classification. The multivariate proportional and multivariate logarithmic strategies made further
use of qualitative and quantitative trait data to select diverse accessions within each stratum. The 18
quantitative trait data distribution parameters were calculated for each core and for the entire collection
for pairwise comparison to validate the sampling strategies. All strategies were adequate for assembling
a core collection. The random core collection best represented the entire collection in statistical terms.
Proportional and logarithmic strategies did not maximize statistical representation but were better in
selecting maximum variability. Multivariate proportional and multivariate logarithmic strategies pro-
duced the best core collections as measured by maximum variability conservation. The soybean core
collection was established using the multivariate proportional selection strategy.
© 2010 Elsevier B.V. All rights reserved.
1. Introduction
Access to genetic variability is critical for plant breeding.
Germplasm conservation centers were created to preserve the
available genetic variability before it is lost due to the widespread
use of modern, improved cultivars (Brown, 1989b). In the 1980s,
the International Board for Plant Genetic Resources (IBPGR) pro-
vided substantial financial support for germplasm preservation
that resulted in an increase in the number of collections. The
emphasis placed on conservation led to the establishment and
preservation of large collections, which were not totally or even
partially evaluated or characterized. Although large germplasm
collections are desirable from the perspective of genetic variabil-
ity preservation (Frankel and Bennett, 1970), their usefulness and
accessibility can be inversely related to their size (Frankel and
Soulé, 1981). The increase in the number of accessions that are not
adequately evaluated can diminish the effectiveness of collections
(Holden, 1984; Marshall, 1989).
∗
Corresponding author. Tel.: +55 43 3371 6263; fax: +55 43 33716001.
E-mail address: embrapamfo@gmail.com (M.F. Oliveira).
China has the largest Glycine collection, with approximately
26,000 accessions of Glycine max and 6200 accessions of Glycine
soja, located in the Institute of Crop Germplasm Resources of the
Chinese Academy of Agricultural Science in Beijing (Chang et al.,
1999; Carter et al., 2004) and has developed a core collection of this
accessions (Zhang et al., 2003). A core collection of the perennial
Glycine species has also been established (Brown et al., 1987).
The soybean germplasm collection of the United States Depart-
ment of Agriculture (USDA) is the second largest in the world
with 16,999 accessions of introduced G. max, 1116 accessions of
G. soja and 919 accessions of perennial Glycine species. Detailed
origin data are available for most entries in the USDA Soybean
Germplasm Collection as are data for many descriptive, agronomic,
and seed composition traits. For most germplasm collections, there
is a gap between the germplasm availability and its use (Peeters
and Galwey, 1988) due to the collection size and financial limi-
tations. In general, germplasm conservation programs have been
more successful in ensuring long-term preservation then in facili-
tating germplasm use.
The establishment of core collections, as proposed by Frankel
and Brown (1984), is an effective strategy to optimize human, mate-
rial, and financial resources by providing greater efficiency in the
use of germplasm collections (Spagnoletti-Zeuli and Qualset, 1993;
0378-4290/$ – see front matter © 2010 Elsevier B.V. All rights reserved.
doi:10.1016/j.fcr.2010.07.021