Analysis of Association at Single Nucleotide Polymorphisms in the APOE Region Eden R. Martin,* , † ,1 John R. Gilbert,* , † Eric H. Lai,‡ John Riley,§ Allison R. Rogala,* , † Brandon D. Slotterbeck,* , † Catherine A. Sipe,* , † Janet M. Grubber,* , † Liling L. Warren,* , † P. Michael Conneally, ¶ Ann M. Saunders,* ,  Donald E. Schmechel,* ,  Ian Purvis,§ Margaret A. Pericak-Vance,* , † Allen D. Roses,* , ‡ and Jeffery M. Vance* * Department of Medicine, †Center for Human Genetics, and Bryan Alzheimer Disease Research Center, Duke University Medical Center, Durham, North Carolina 27710; ‡Glaxo Wellcome, Inc., Five Moore Drive, Research Triangle Park, North Carolina 27709; §Glaxo Wellcome, Inc., Greenford Road, Greenford, Middlesex UB6 OH3, London, United Kingdom; and ¶ Department of Medicine and Molecular Genetics, Indiana University School of Medicine, Indianapolis, Indiana 46202 Received August 30, 1999; accepted November 12, 1999 The discussion of the prospects of using a dense map of single nucleotide polymorphisms (SNPs) to identify disease genes with association analysis has been ex- tensive. However, there is little empiric evidence to support this strategy. To begin to examine the practi- cal issues surrounding this methodology, we identiﬁed 10 SNPs in the region immediately surrounding the apolipoprotein E locus (APOE), an established suscep- tibility gene for Alzheimer disease. Our goal was to examine patterns of allelic association to begin to in- vestigate the question of whether APOE could have been identiﬁed using SNPs. Our strongest evidence of association was at the 2 SNPs immediately ﬂanking APOE. © 2000 Academic Press INTRODUCTION One of the major goals of the Human Genome Project (HGP) has been to identify a set of easily typed mark- ers spanning the genome (Collins and Galas, 1993). As a result, thousands of microsatellites have been mapped and are in widespread use in genetic studies. With the achievement of this aim, the newest endeavor of the HGP is to identify hundreds of thousands of single nucleotide polymorphisms (SNPs) across the ge- nome. It is anticipated that, over the next 3 years, at least 100,000 –200,000 SNPs will be identiﬁed by the HGP (Collins et al., 1998). Through a separate initia- tive, the SNP Consortium (TSC) has been formed with the goal of generating 300,000 SNPs in the next 2 years, with at least half of the SNPs mapped by mid- year 2001 (Marshall, 1999). Although SNPs will have generally lower heterozy- gosity and, therefore, be potentially less informative than microsatellite markers, they have several advan- tages. SNPs are more abundant than microsatellites; they are estimated to occur, on average, every 750 – 1000 bp (Kwok et al., 1996;Wang et al., 1998). Analysis with SNPs may also be advantageous since they gen- erally have a lower mutation rate than microsatellites. These properties make SNPs ideal for gene identiﬁca- tion through association analysis since they increase the chance of ﬁnding a marker polymorphism in asso- ciation with a disease allele. Association analysis at- tempts to identify disease genes by looking for associ- ations between genetic marker alleles and a disease phenotype in case-control or family samples. Associa- tion studies can be more powerful for localizing genes than linkage analysis, particularly when the contribu- tion of these genes to the disease is small, as would be expected for complex diseases (Risch and Merikangas, 1996). For association analysis to be useful, however, a dense map of markers will be required since associa- tions can generally be found over only small distances, say no larger than 1 cM and often much smaller (Bod- mer, 1986; Jorde et al., 1993). Although genome-wide association studies using SNPs have been proposed (Risch and Merikangas, 1996), there are many difﬁculties to be addressed be- fore this strategy becomes feasible. A more immediate use of SNPs will be for association analysis in candi- date genes or regions. SNPs may be used to examine candidate genes that have been implicated through their biological function or because they lie in a region identiﬁed by linkage analysis. Even when no obvious candidate gene exists in a region of linkage, SNPs may be useful for narrowing the region of interest. Regions identiﬁed by linkage analysis are often quite large, especially for complex disorders, easily spanning 10 –20 cM or greater (Hauser and Boehnke, 1997). Since associations are found generally over much smaller distances, one strategy for identifying the dis- 1 To whom correspondence should be addressed at Duke Univer- sity Medical Center, Box 3468, Durham, NC 27710. Telephone: (919) 402-2553. Fax: (919) 401-0166. E-mail: emartin@chg.mc.duke.edu. Genomics 63, 7–12 (2000) doi:10.1006/geno.1999.6057, available online at http://www.idealibrary.com on 7 0888-7543/00 $35.00 Copyright © 2000 by Academic Press All rights of reproduction in any form reserved.