Transcription Start Site Sequence and Spacing between the 10
Region and the Start Site Affect Reiterative Transcription-Mediated
Regulation of Gene Expression in Escherichia coli
Xiaosi Han,* Charles L. Turnbough, Jr.
Department of Microbiology, University of Alabama at Birmingham, Birmingham, Alabama, USA
Reiterative transcription is a reaction catalyzed by RNA polymerase, in which nucleotides are repetitively added to the 3= end of a
nascent transcript due to upstream slippage of the transcript without movement of the DNA template. In Escherichia coli, the
expression of several operons is regulated through mechanisms in which high intracellular levels of UTP promote reiterative
transcription that adds extra U residues to the 3= end of a nascent transcript during transcription initiation. Immediately follow-
ing the addition of one or more extra U residues, the nascent transcripts are released from the transcription initiation complex,
thereby reducing the level of gene expression. Therefore, gene expression can be regulated by internal UTP levels, which reflect
the availability of external pyrimidine sources. The magnitude of gene regulation by these mechanisms varies considerably, even
when control mechanisms are analogous. These variations apparently are due to differences in promoter sequences. One of the
operons regulated (in part) by UTP-sensitive reiterative transcription in E. coli is the carAB operon, which encodes the first en-
zyme in the pyrimidine nucleotide biosynthetic pathway. In this study, we used the carAB operon to examine the effects of nucle-
otide sequence at and near the transcription start site and spacing between the start site and 10 region of the promoter on reit-
erative transcription and gene regulation. Our results indicate that these variables are important determinants in establishing
the extent of reiterative transcription, levels of productive transcription, and range of gene regulation.
U
sually during transcription, the nascent RNA transcript and
the template strand of DNA move in tandem as an RNA-
DNA hybrid. However, during transcription of a homopolymeric
tract in the DNA template, the nascent transcript can slip (typi-
cally) one base upstream without movement of the DNA template
within the active site of RNA polymerase (RNAP) (1, 2). This
repositioning allows the same template base to specify an addi-
tional nucleotide in the transcript, and when transcript slippage
occurs repetitively, the same template nucleotide can specify mul-
tiple extra residues. This reaction is called reiterative transcription
(also known as RNAP stuttering, transcription slippage, and pseu-
dotemplated transcription) and appears to be catalyzed by all
RNAPs (2–5). Reiterative transcription can involve the repetitive
addition of any of the four nucleoside triphosphate substrates and
occurs during transcription initiation, elongation, and termina-
tion (2, 3, 6, 7). During initiation, when the length of the RNA-
DNA hybrid can be shorter than that of the 8- to 9-bp hybrid that
forms during elongation (8), a homopolymeric tract as short as
three residues can enable reiterative transcription (9, 10). In con-
trast, longer homopolymeric tracts usually are required for reiter-
ative transcription during elongation and termination (6, 7).
The physiological significance of reiterative transcription is
that it plays a central role in regulating the expression of numerous
prokaryotic, viral, and eukaryotic genes through an assortment of
different mechanisms (3, 11). In Escherichia coli, expression of at
least five operons is regulated by mechanisms involving the reit-
erative addition of U residues during transcription initiation (4,
12–15). Although these mechanisms can differ fundamentally
(11), three examples include analogous mechanisms that regulate,
in part, the expression of the pyrBI and carAB pyrimidine nucle-
otide biosynthetic operons and the galETKM (gal) galactose cata-
bolic operon (12–14). The promoters of each of these operons (at
least one when there are multiple promoters) include initially
transcribed regions that contain a tract of three T·A base pairs
located one or two bases downstream from the transcription start
site (Fig. 1). These homopolymeric tracts, here referred to as (non-
template strand) T tracts, enable a fraction of the nascent tran-
scripts to enter the reiterative pathway. Entry into this pathway is
enhanced by high intracellular concentrations of UTP (i.e., the
reiterative substrate), resulting in a larger fraction of transcripts
containing extra U residues. In each case, the addition of extra U
residues causes the release of the nascent transcript from the tran-
scription initiation complex, thereby repressing operon expres-
sion. This repression allows the cell to use UTP levels to control
pyrBI, carAB, and gal expression and synthesize the encoded en-
zymes at optimal levels for growth. Although these control mech-
anisms are basically the same, the range of regulation afforded by
them varies considerably, specifically, 2-fold, 3-fold, and 7-fold
for the gal, carAB, and pyrBI operons, respectively (12–14). These
different ranges presumably reflect important differences in pro-
moter sequences, which could include different sequences at or
near the transcription start site or the spacing between the start site
and the 10 region (Fig. 1).
In this study, we constructed mutant variants of the carAB P1
promoter, one of two carAB promoters and the only one that
participates in reiterative transcription-mediated gene regulation
Received 9 April 2014 Accepted 27 May 2014
Published ahead of print 2 June 2014
Address correspondence to Charles L. Turnbough, Jr., chuckt@uab.edu.
* Present address: Xiaosi Han, Department of Neurology, University of Alabama at
Birmingham, Birmingham, Alabama, USA.
Copyright © 2014, American Society for Microbiology. All Rights Reserved.
doi:10.1128/JB.01753-14
2912 jb.asm.org Journal of Bacteriology p. 2912–2920 August 2014 Volume 196 Number 16