A high-throughput apple SNP genotyping platform using the GoldenGateassay M. Awais Khan a , Yuepeng Han b , Youfu Frank Zhao c , Schuyler S. Korban a, a Department of Natural Resources & Environmental Sciences, University of Illinois, Urbana, IL 61801, USA b Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Moshan, Wuhan, 430074, People's Republic of China c Department of Crop Sciences, University of Illinois, Urbana, IL 61801, USA abstract article info Article history: Accepted 1 December 2011 Available online 24 December 2011 Keywords: Expressed sequence tags (ESTs) Malus Genotyping Single Nucleotide Polymorphisms (SNPs) EST data generated from 14 apple genotypes were downloaded from NCBI and mapped against a reference EST assembly to identify Single Nucleotide Polymorphisms (SNPs). Mapping of these SNPs was undertaken using 90% of sequence similarity and minimum coverage of four reads at each SNP position. In total, 37,807 SNPs were identied with an average of one SNP every 187 bp from a total of 6888 unique EST contigs. Identied SNPs were checked for anking sequences of 60 bp along both sides of SNP alleles for reliable de- sign of a custom high-throughput genotyping assay. A total of 12,299 SNPs, representing 6525 contigs, t the selected criterion of 60 bp sequences anking a SNP position. Of these, 1411 SNPs were validated using four apple genotypes. Based on genotyping assays, it was estimated that 60% of SNPs were valid SNPs, while 26% of SNPs might be derived from paralogous regions. © 2011 Elsevier B.V. All rights reserved. 1. Introduction In recent years, crop improvement efforts by capitalizing on iden- tication and utilization of genetic diversity have made major strides with the aid of molecular marker technology. Molecular markers are highly reliable in identifying and selecting parents carrying traits of in- terest, advancing breeding schemes, and ultimate crop improvement via either marker-assisted selection/breeding and/or map-based cloning (Han and Korban, 2010). Molecular markers are routinely used to detect and link variations in traits of interest through Quantitative Trait Loci (QTL) mapping. Once DNA markers linked to a trait of interest are identied, and the linkage is close, these markers can be used to select for seedlings possessing the desirable charac- ter(s) in a breeding population using a particular breeding scheme, such as backcrossing (Collard and Mackill, 2008; Moose and Mumm, 2008). Unlike phenotypic evaluation, selection based on DNA markers is not inuenced by environmental factors. Moreover, marker-assisted selection can be carried out at the seedling stage, and therefore would alleviate requirements for land space, eld evaluations, plant maintenance, along with associated costs, when compared to phenotypic selection (Collard and Mackill, 2008). Among various available molecular marker systems, Microsatellites or Simple Se- quence Repeats (SSRs) have become the marker of choice as these are characterized by 1 to 5 bp repeat motifs, and are highly polymor- phic between and among species (Korban and Tartarini, 2009). More- over, they are multi-allelic, co-dominant, reproducible, and transferable across different species. SSR markers are preferred for QTL mapping studies, marker-assisted selection, and comparative ge- netic analysis (Liebhard et al., 2003). Initial costs for development of SSR markers from genomic regions are usually very high compared to other marker systems. However, for crops wherein expressed se- quence tags (ESTs) are available, SSR development based on EST sequences is easy and cheap. Therefore, EST-SSRs have been recently developed in a wide range of plant species, and are being used for constructing genetic maps and for pursuing phylogenetic studies. Moreover, molecular markers developed from ESTs represent gene- coding regions, and thus are useful tools for bridging functional and structural genomics (Chagné et al., 2008; Korban and Tartarini, 2009; Varshney et al., 2005). To date, DNA sequences for many plant species, particularly those of ESTs, are publically available in large numbers in the NCBI database. Apple genetic maps have been generally constructed using a backbone of genomic SSR markers and populated with RAPDs, RFLPs, and AFLPs for QTL identication (Liebhard et al., 2003; Silfverberg-Dilworth et al., 2006). With avail- ability of apple EST sequences, EST-SSRs have been identied in silico and used for constructing new genetic maps (Han et al., 2011; Naik et al., 2006). Although both genomic SSRs and EST-SSRs offer advan- tages over other marker systems, there are some concerns over their throughput efcacy, identication, genotyping costs, as well as Gene 494 (2012) 196201 Abbreviations: ESTs, Expressed sequence tags; SNPs, Single Nucleotide Polymor- phisms; QTL, Quantitative Trait Loci; RAPDs, Random Amplied Polymorphic DNAs; RFLPs, Restriction Fragment Length Polymorphisms; AFLPs, Amplied Fragment Length Polymorphisms; SSRs, Microsatellites or Simple Sequence Repeats; HRM, High- resolution Melting; ADT, Assay Design Tool; GC, GenCall; LD, Linkage Disequilibrium. Corresponding author. Tel./fax: + 1 217 333 8928. E-mail address: korban@illinois.edu (S.S. Korban). 0378-1119/$ see front matter © 2011 Elsevier B.V. All rights reserved. doi:10.1016/j.gene.2011.12.001 Contents lists available at SciVerse ScienceDirect Gene journal homepage: www.elsevier.com/locate/gene