NEWS AND VIEWS
learning whether these chemical leads can be
translated into new therapies in areas of
unmet medical need.
1. Peterson R.T. et al. Nat. Biotechnol. 22, 595–599
(2004).
2. Mayer, T.U. et al. Science 286, 971–974 (1999).
3. Stern, H.M. et al. Patent application WO03052106
(2004).
4. Peterson, R.T. et al. Proc. Natl. Acad. Sci. USA 97,
12965–12969 (2000).
5. Clader, J.W. J. Med. Chem. 47, 1–9 (2004).
6. Schafer, G. Diabetes Metab. 9, 148–163 (1983).
7. Kothari, S.A. et al. J. Thromb. Thrombolysis 15,
217–225 (2003).
8. Service, R.F. Science 303, 1798 (2004).
9. Margolis, J. & Duyk, G. Nat. Biotechnol. 16, 311
(1998).
10. Fischer, A. & Gessler, M. Trends Cardiovasc. Med. 6,
221–226 (2003).
Analysis of the T. thermophilus HB27 gen-
ome sequence reveals a variety of interesting
pathways and enzymes, including metabolic
pathways for vitamin B12 and carotenoid
biosynthesis, as well as various proteases,
glycoside hydrolases, phosphatases, oxidore-
ductases and DNA-modifying enzymes.
Certain T. thermophilus enzymes are inter-
esting candidates for biotechnological app-
lications
5
, such as molecular biological
techniques for research and diagnostic use
(DNA and RNA processing enzymes), in the
starch industry (starch-converting enzymes),
or in the food, detergent or leather industries
(proteases). Turning interesting enzymes
identified from genome information into
marketable enzyme is a long and arduous
process, however. Success will depend not
only on the catalytic and stability properties
of a particular enzyme of interest but also on
its performance under the often-stringent
conditions of existing or newly developed
industrial processes.
One of the advantages of using ther-
mophilic eubacteria rather than archaea as a
source of industrial enzyme-encoding seq-
uences is that eubacterial genes are often
more readily expressed in Escherichia coli or
other standard bacterial cloning hosts than
archaeal genes. In fact, initial expression
experiments often work with a recombinant
genes’ own expression signals, which can
greatly simplify cloning and expression
screening procedures. Efficient expression in
standard host bacteria is of further benefit
when embarking on large-scale production of
an enzyme of interest.
Even with these advantages, however, sev-
eral potential obstacles may also complicate
recombinant expression of T. thermophilus
proteins. The first problem in obtaining high-
level expression may be the unusual composi-
tion of T. thermophilus genes. The genus
Thermus belongs to a deeply branching line-
age in the bacterial evolutionary tree, with
DNA with a high (69.4 %) G+C content.
Although more practical experience is needed
to show how well heterologous expression of
T. thermophilus genes of interest work in stan-
dard expression systems, several T. aquaticus
and T. thermophilus genes have already been
successfully expressed in E. coli. And even if
high-level expression of a particularly inter-
esting open reading frame (ORF) were to fail,
resynthesis of the entire gene, taking into
account the codon usage of the host organ-
ism, could provide a solution.
A second problem facing recombinant
expression is folding. It is not clear whether
recombinant expression in a mesophilic bac-
terium will deleteriously affect the folding of
524 VOLUME 22 NUMBER 5 MAY 2004 NATURE BIOTECHNOLOGY
The genome sequences of at least 14 extre-
mely thermophilic microorganisms (with
growth optima above 70 °C) and hyperther-
mophilic microorganisms (optimum growth
temperature at least 80 °C) have now been
determined. The latest to be completed—
reported in this issue by Henne et al.
1
—is the
genome of Thermus thermophilus, an aerobic,
obligately heterotrophic eubacterium capable
of growth at temperatures up to ∼85 °C.
T. thermophilus is one of only four extremely
thermophilic eubacteria sequenced to date.
With the genome sequence in hand and its
relative amenability to genetic manipulation,
T. thermophilus is now positioned as an inter-
esting model organism for extremely ther-
mophilic eubacteria. In the longer term, the
availability of its genome sequence, which
consists of a 1.9-Mbp chromosome and a
0.23-Mbp megaplasmid, should facilitate the
identification of novel proteins and pathways
of potential biotechnological interest.
A survey of prokaryotic genome data avail-
able over the web
2
shows that, thus far, many
more eubacterial genomes (140) than arch-
aeal genomes (18) have been sequenced (a
ratio of about 8:1)
2
. Of the 14 genomes of
extreme thermophiles and hyperthermo-
philes, 10 are from archaea and only 4 from
eubacteria. The three eubacteria previously
sequenced—the chemolithoautotrophic hyp-
erthermophile Aquifex aeolicus, the chemo-
organotrophic hyperthermophile Thermo-
toga maritima and the chemoorganotrophic
extreme thermophile Thermoanaerobacter
tengcongensis—have now been joined by
Thermus thermophilus
1
. The abundance of
genomes from archaea reflects the high repre-
sentation of these microorganisms in extreme
environments as compared with eubacteria.
Studies on the biochemical, biophysical
and structural characterization of enzymes
from extreme thermophiles (particularly
hyperthermophilic bacteria and archaea, such
as Thermotoga or Pyrococcus sp.) have led to a
wealth of information on thermostable bio-
catalysts and other basic aspects of protein
stabilization
3,4
. Because of their inherently
robust nature, their insensitivity toward
denaturing conditions (not only heat, but also
chemical denaturants and organic solvents)
and their reduced conformational flexibility
at ambient temperature, proteins from
extreme thermophiles are often the preferred
choices as substrates for X-ray crystallogra-
phy studies or templates for protein engineer-
ing. The above properties are also desirable
for biotechnological applications, where not
are only biocatalytic characteristics (e.g., reac-
tion specificity or conversion rates) impor-
tant, but also ease of handling, shelf-life and
stability may be critical.
Tth (T. thermophilus) and Taq (Thermus
aquaticus) DNA polymerases are already
familiar to many molecular biologists as com-
mercialized enzymes routinely employed in
nucleic acid amplification procedures (e.g.,
polymerase chain reaction (PCR) amplifica-
tion and reverse transcriptase (RT)-PCR).
But with the T. thermophilus genome now in
hand, we can expect to uncover much more
about the unique and exotic enzymatic
machinery of this exceptional organism.
Wolfgang Liebl is at the Institut für
Mikrobiologie und Genetik at the Georg-
August-Universität, Grisebachstrasse 8,
D-37077 Göttingen, Germany.
e-mail: wliebl@gwdg.de
Genomics taken to the extreme
Wolfgang Liebl
The genome sequence of Thermus thermophilus may provide new insights
into heat-tolerant enzymes and metabolic pathways of biotechnological
potential.
© 2004 Nature Publishing Group http://www.nature.com/naturebiotechnology