Role of T7 RNA Polymerase His784 in Start Site Selection and Initial Transcription Luis G. Brieba, Robert Padilla, and Rui Sousa* Department of Biochemistry, UniVersity of Texas Health Sciences Center, 7703 Floyd Curl DriVe, San Antonio, Texas 78284-7760 ReceiVed December 18, 2001; ReVised Manuscript ReceiVed February 12, 2002 ABSTRACT: The role of steric constraints vs sequence preference in start site selection by T7 RNA polymerase was investigated by using a series of synthetic promoters in which the preferred template strand ‘CC’ initiation sequence was moved away from its normal position relative to the -17 to -6 element of the T7 promoter. It was found that the CC sequence directs efficient initiation if placed 1 or 2 nt downstream of its normal position, but not if placed upstream, or more than 2 nt downstream, of +1. Mutagenesis revealed that part of the bias to initiate with GTP is due to an interaction between histidine 784 and the 2-amino group of a guanosine bound in the initiating triphosphate position. This interaction is also important for holding short transcripts within the transcription complex during initial transcription. The 23 base pair T7 promoter has a tripartite structure, with distinct elements important for specific binding of T7 RNAP, 1 for promoter opening, and for facilitating the initiation and extension of transcripts to a length which allows formation of a stable elongation complex (EC). The sequence from -17 to -6 is critical for sequence-specific binding of the polymerase (1-9). Mutations in this region reduce promoter-polymerase affinity, and the crystal struc- ture of an RNAP-promoter complex reveals numerous interactions with bases in this region (10, 11). Mutations in the -4 to -1 ‘TATA’ sequence reduce promoter activity, but can be compensated for by negative supercoiling (3, 4), suggesting that the role of this element is to facilitate promoter opening. In fact, while the -17 to -6 sequence must be duplex to direct specific binding of the polymerase, the promoter can be single-stranded downstream of -5 and still be fully active (12). Crystal structures (10, 11) and permanganate reactivity (13, 14) confirm that the -1 to -4 sequence is melted during initiation, but that the upstream sequence remains duplex. The third part of the promoter is formed by the sequence immediately downstream of the transcription start site (+1 to +6; the initially transcribed sequence or ITS). The nontemplate (NT) strand ITS of the most active T7 promoters is ‘GGGAGA’ (15). Changes to this sequence reduce initiation rates, increase apparent NTP K s values, and increase dissociation of short transcripts from the transcription complex during initial transcription, with changes at +1 or +2 having the most severe effects (6, 16- 19). The ITS also appears to be important in specifying the transcription start site. Though start site selection is deter- mined primarily by steric constraintsstranscription usually begins 6 nt downstream of the -17 to -6 elementsthere is a strong preference for initiating with ‘GG’. In fact, if the T strand base at +1 is changed to G, A, or T, but C occurs at +2, then transcription will start from +2(6, 20). The effects of changes to the ITS are similar both in fully duplex promoters and in those lacking an NT strand downstream of -5, indicating that the NT strand is unlikely to play a major role in ITS function. The role of the ITS in directing efficient initiation and in stabilizing the association of short transcripts with the initial transcription complex (ITC) may involve multiple mechanisms. When starting with the canonical ‘GGGAGA’ sequence, purine-purine stacking and G-C hydrogen bonding may enhance binding of the initiating NTPs, as well as transcript -template and transcript- NTP interactions. However, such mechanisms should operate with any RNAP, but not all RNAPs display a strong preference for initiating with guanosines. Therefore, it is likely that T7 RNAP active site side chains also make base-specific interactions with either the T strand or the NTPs. To identify such interactions, we examined the structure of a T7 RNAP ITC and identified two side chains likely to make base-specific interactions important either for start site selection or for binding the transcript during initial transcription. Mutation of one of these (R425) severely reduces RNAP activity, and mutations at this position were not further characterized. However, the effects of mutations of H784 reveal that this residue makes an H-bond to the 2-amino group of the initiating guanosine triphosphate and the 3-guanosine of the transcript. Thus, this interaction is important both for directing initiation with guanosines and for holding G-terminated short RNAs within the ITC during initial transcription. MATERIALS AND METHODS Mutant construction, enzyme expression, and purification were as described (21). Modified NTPs were from Trilink Supported by the Welch Foundation, by NIH Grant GM 52522 (to R.S.), and by a Fulbright-CONACYT Fellowship (to L.G.B.). * To whom correspondence should be addressed. Phone: 210-567- 8782. Fax: 210-567-8778. E-mail: sousa@biochem.uthscsa.edu. 1 Abbreviations: RNAP, RNA polymerase; EC, elongation complex; ITS, initially transcribed sequence; NT, nontemplate; T, template; ITC, initial transcription complex; wt, wild type; TBE, Tris-borate-EDTA; nt, nucleotide(s). 5144 Biochemistry 2002, 41, 5144-5149 10.1021/bi016057v CCC: $22.00 © 2002 American Chemical Society Published on Web 03/23/2002