Overview
DNA barcoding demystified
Andrew Mitchell*
NSW Department of Primary Industries, Wagga WaggaAgricultural Institute, Private Mail Bag, Wagga Wagga, NSW
2650, Australia.
Key words barcode, DNA taxonomy, molecular diagnostics.
INTRODUCTION
Only 10% of the earth’s biota has been described despite
250 years of taxonomic research (Wilson 2000). This is in large
part a reflection of the extent and complexity of biological
diversity, but it is also true that traditional taxonomic techniques
are labourious and highly specialised, and taxonomic expertise
is very thinly spread across the myriad groups of life (Scotland
et al. 2003). As a result, obtaining species identifications, even
for a modest sample of commonly encountered insects in a
biologically diverse country, such as Australia, is at best time-
consuming and costly and all too often impossible. This is a
result of the difficulty that morphological taxonomic techniques
encounter in dealing with phenotypic differences between
immature and mature stages of a species, and among individu-
als of variable species, the lack of phenotypic differences
among cryptic species, shifts in geographical distribution as a
result of invasions or range expansions and the overwhelming
diversity of life. Our reliance on morphological approaches,
thus, places serious limitations on the diagnosis of biological
diversity at a time when we are facing a biodiversity crisis of
unprecedented magnitude. It has long been clear that new
approaches to biodiversity assessment, description and diagno-
sis are needed, and indeed, the 1990s saw the emergence of
many novel taxonomic initiatives, leading E.O. Wilson to con-
clude that ‘the 19th century culture of taxonomy has begun to be
replaced’ (Wilson 2003). The emerging discipline of DNA
barcoding (Floyd et al. 2002; Hebert et al. 2003) represents
another advance and part of the solution to these problems. This
paper provides a brief overview of DNA barcoding, and con-
siders what barcoding offers researchers and industry alike.
WHAT ARE DNA BARCODES AND ARE
THEY NOVEL?
DNA barcodes are short DNA sequences (500 base pairs)
that can be used to identify species. The ‘barcode’ analogy
refers, of course to the universal product codes (UPC) affixed
to merchandise in retail stores. The analogy suggests that DNA
barcodes similarly provide a universal system of readily scan-
nable, unique tags for products (species), speeding and auto-
mating transactions (identifications) and facilitating novel
applications, such as automated stock management (biodiver-
sity assessment) and so on. Unfortunately, the barcode analogy
also has been used by the microbial diagnostics community to
refer to the banding pattern of rep-polymerase chain reaction
(PCR) products (a technique similar in principle to Random
Amplification of Polymorphic DNA (RAPD)-PCR) visualised
on a gel, which bears some resemblance to UPCs (Sutton &
Cundell 2004). Love it or hate it, the term ‘DNA barcoding’
has caught the public imagination, and is likely here to stay.
DNA sequence data from the mitochondrial genome have
long been used to resolve difficult taxonomic questions and
diagnose species in cases where morphology alone has proved
inadequate (e.g. Brown et al. 1999; Mitchell & Samways
2005), so is DNA barcoding just molecular systematics and
molecular diagnostics repackaged or is it a truly novel enter-
prise? The novelty in DNA barcoding is its combination of
four main factors: standardisation on a particular gene region,
the large scale of operation, compulsory vouchering of speci-
mens and active curation of the resulting databases.
First, standardisation on a particular gene region for all
species in a particular branch of life leads to universality of the
system. For animals the standard is the upstream half of the
mitochondrial gene cytochrome c oxidase subunit I (COI or
cox1). Without standardisation, molecular systematic data sets
lack interoperability, building ‘Towers of Babel’ (Caterino
et al. 2000). The same is true of molecular diagnostics – the
literature contains a plethora of highly specialised methods for
identifying particular organisms, and this leads to some circu-
larity in that one has to have a well-developed idea of an
organism’s identity (e.g. which genus) to even know which
diagnostic test to apply. Furthermore, non-sequence-based
diagnostic tests are incapable of uniquely identifying novel
isolates, and must be redeveloped if novel isolates are discov-
ered. Standardisation allows for the generalisation of the pro-
tocols and skills needed to implement diagnostics so that
identifications can be performed by technicians with generic
molecular biology skills.
Second, the use of high-throughput techniques adapted from
genomics allows small molecular biology laboratories to
process hundreds of samples per day (thousands if they use *andrew.mitchell@dpi.nsw.gov.au
Australian Journal of Entomology (2008) 47, 169–173
© 2008 The Author
Journal compilation © 2008 Australian Entomological Society doi:10.1111/j.1440-6055.2008.00645.x