Long-Loop GQuadruplexes Are Misfolded Population Minorities with Fast Transition Kinetics in Human Telomeric Sequences Deepak Koirala, § Chiran Ghimire, § Christopher Bohrer, # Yuta Sannohe, Hiroshi Sugiyama,* ,, and Hanbin Mao* ,§ § Department of Chemistry & Biochemistry and # Department of Physics, Kent State University, Kent, Ohio 44242, United States Department of Chemistry, Graduate School of Science, and Institute for Integrated Cell Material Sciences (iCeMS), Kyoto University, Sakyo-ku, Kyoto, Japan * S Supporting Information ABSTRACT: Single-stranded guanine (G)-rich sequences at the 3end of human telomeres provide ample opportunities for physiologically relevant structures, such as G-quadruplexes, to form and interconvert. Population equilibrium in this long sequence is expected to be intricate and beyond the resolution of ensemble-average techniques, such as circular dichroism, NMR, or X-ray crystallography. By combining a force-jump method at the single-molecular level and a statistical population deconvolution at the sub-nanometer resolution, we reveal a complex population network with unprecedented transition dynamics in human telomeric sequences that contain four to eight TTAGGG repeats. Our kinetic data rmly establish that G-triplexes are intermediates to G-quadruplexes while long-loop G-quadruplexes are misfolded population minorities whose formation and disassembly are faster than G-triplexes or regular G-quadruplexes. The existence of misfolded DNA supports the emerging view that structural and kinetic complexities of DNA can rival those of RNA or proteins. While G-quadruplexes are the most prevalent species in all the sequences studied, the abundance of a misfolded G-quadruplex in a particular telomeric sequence decreases with an increase in the loop length or the number of long-loops in the structure. These population patterns support the prediction that in the full-length 3overhang of human telomeres, G-quadruplexes with shortest TTA loops would be the most dominant species, which justies the modeling role of regular G-quadruplexes in the investigation of telomeric structures. INTRODUCTION In human cells, telomeres at the end of chromosomes consist of single-stranded 3overhang of 200 nucleotides with a consensus guanine (G)-rich repeat sequence, 5-TTAGGG. 1-4 Four such G-rich repeats are known to form a stable DNA secondary structure, G-quadruplex (GQ). 5,6 A G-quadruplex is composed of a stack of G-quartets, each of which is held together by four guanines through Hoogsteen hydrogen bonds and further stabilized by intercalating cations such as K + or Na + . 7,8 Biological investigations suggest that these telomeric DNA secondary structures can regulate the length of telomere either by interfering with telomerase activity or by participating in events such as uncapping of telosomes. 9-11 Since telomere length is closely associated with cellular processes that lead to senescence or cancer, telomeric G-quadruplexes become an attractive target for cancer treatment. 12,13 Despite their simple repeating sequence, human telomeric G- quadruplexes exhibit a stunning structural polymorphism. At least nine conformations of telomeric G-quadruplex have been revealed in dierent buers or in DNA templates that contain four G-rich repeats with varying anking sequences. 14-21 The observation of partially folded structures either as intermediates to G-quadruplexes or as terminally folded species 22-24 added another level of structural complexity. One rationale for this structural polymorphism is that it presents a exible regulatory mechanism for cellular processes. Particular biological functions may be regulated by prevailing structures in a population equilibrium that is dependent on cellular conditions such as pH or proteins. However, ensemble-average techniques, such as circular dichroism (CD), NMR, or X-ray crystallography, have diculties to deconvolute individual species, especially those with insignicant population fractions or short lifetimes, formed in the same biological molecule. To resolve a structure in such a population mixture, mutations in the biological molecule are often required to selectively populate the species of interest. 20,25,26 Recently, such a practice surprisingly revealed G-quadruplex conformations that harbor (TTAGGGTTA) n in one of the loops (long-loop GQs) in human telomeric DNA fragments with more than four TTAGGG repeats. 25 This procedure, however, changes population equilibrium and distorts the transition between dierent species. Due to these diculties, it is yet to clarify the population equilibrium and the Received: October 3, 2012 Published: January 17, 2013 Article pubs.acs.org/JACS © 2013 American Chemical Society 2235 dx.doi.org/10.1021/ja309668t | J. Am. Chem. Soc. 2013, 135, 2235-2241