TECHNICAL ADVANCE Open Access
Biomarker discovery: quantification of
microRNAs and other small non-coding
RNAs using next generation sequencing
Juan Pablo Lopez
1,2
, Alpha Diallo
1
, Cristiana Cruceanu
1,2
, Laura M. Fiori
1
, Sylvie Laboissiere
3
, Isabelle Guillet
3
,
Joelle Fontaine
3
, Jiannis Ragoussis
2,3
, Vladimir Benes
4
, Gustavo Turecki
1,2
and Carl Ernst
1,2*
Abstract
Background: Small ncRNAs (sncRNAs) offer great hope as biomarkers of disease and response to treatment.
This has been highlighted in the context of several medical conditions such as cancer, liver disease, cardiovascular
disease, and central nervous system disorders, among many others. Here we assessed several steps involved in the
development of an ncRNA biomarker discovery pipeline, ranging from sample preparation to bioinformatic
processing of small RNA sequencing data.
Methods: A total of 45 biological samples were included in the present study. All libraries were prepared using the
Illumina TruSeq Small RNA protocol and sequenced using the HiSeq2500 or MiSeq Illumina sequencers. Small RNA
sequencing data was validated using qRT-PCR. At each stage, we evaluated the pros and cons of different techniques
that may be suitable for different experimental designs. Evaluation methods included quality of data output in relation
to hands-on laboratory time, cost, and efficiency of processing.
Results: Our results show that good quality sequencing libraries can be prepared from small amounts of total RNA and
that varying degradation levels in the samples do not have a significant effect on the overall quantification of sncRNAs
via NGS. In addition, we describe the strengths and limitations of three commercially available library preparation
methods: (1) Novex TBE PAGE gel; (2) Pippin Prep automated gel system; and (3) AMPure XP beads. We describe
our bioinformatics pipeline, provide recommendations for sequencing coverage, and describe in detail the expression
and distribution of all sncRNAs in four human tissues: whole-blood, brain, heart and liver.
Conclusions: Ultimately this study provides tools and outcome metrics that will aid researchers and clinicians in
choosing an appropriate and effective high-throughput sequencing quantification method for various study designs,
and overall generating valuable information that can contribute to our understanding of small ncRNAs as potential
biomarkers and mediators of biological functions and disease.
Keywords: Biomarker, microRNA, Small non-coding RNA, Next-generation sequencing, Small RNA sequencing,
Whole-blood, Brain, Heart, Liver, Clinical samples
* Correspondence: carl.ernst@mcgill.ca
1
McGill Group for Suicide Studies (MGSS), Douglas Mental Health University
Institute, McGill University, Frank B Common Pavilion, Room F-2101.2, 6875
LaSalle Boulevard, Montreal, QC H4H 1R3, Canada
2
Department of Human Genetics, McGill University, Montreal, QC, Canada
Full list of author information is available at the end of the article
© 2015 Lopez et al. This is an Open Access article distributed under the terms of the Creative Commons Attribution License
(http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium,
provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://
creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
Lopez et al. BMC Medical Genomics (2015) 8:35
DOI 10.1186/s12920-015-0109-x