476 | VOL.6 NO.7 | JULY 2009 | NATURE METHODS CORRESPONDENCE profiling and qPCR recovered a nonuniform distribution of microRNAs (Fig. 2a); we observed up to four orders of magnitude dif- ference between the most and least frequently detected microRNAs. Only 61% (SREK-SOLiD) and 52% (modban-Solexa) of the microRNAs varied within a single order of magnitude (Fig. 2a). These results showed the inherent quantification biases of both DGE profiling and qPCR based on microRNA sequence, complicating comparison of microRNA amounts in a sample. Correction of the biological dataset with the frequency bias obtained using the synthetic RNA pool did not improve the correlation between the library-preparation methods (data not shown). We therefore used the synthetic small RNA dataset to explore the potential basis of systematic biases. Although we found clear effects of certain terminal mono- and dinucleotides ( Supplementary Fig. 4), we could not identify a satisfactory correction model based on primary (RNA sequence) and secondary (for example, folding characteristics) parameters (Supplementary Fig. 5 and Supplementary Note). This might be explained by our observation that even single nucleotide differences influenced the read frequencies (Supplementary Fig. 6). RNA ligase preferences 7 may contribute to the observed different terminal nucleotides over the read frequency spectrum. In addition, the reverse-transcriptase reaction as well as the PCR could be a contributor to the bias 8 . To determine whether DGE profiling allows for differential expression analysis, we sequenced small RNA libraries from rat spleen and liver (SREK-SOLiD). In parallel, we analyzed the input RNA by qPCR. Similar to our previous results, qPCR data differed substantially from the read frequencies within a sample (Fig. 2b). However, differential expression results between samples obtained by qPCR and DGE profiling were strongly correlated (Fig. 2b), showing that the systematic biases do not prohibit the comparison of relative microRNA amounts between samples. Despite the limitations described here, small RNA profiling by DGE is the method of choice for studying small RNA expres- sion. In contrast to most other existing methods, DGE profiling is hybridization-independent, accurate in discriminating microRNA family members that differ by only a single nucleotide, capable of detecting 5and 3end variability (for example, isoMirs), and as the approach does not require a priori information, it can be used to simultaneously detect known and discover new biomolecules. Note: Supplementary information is available on the Nature Methods website. ACKNOWLEDGMENTS This work was supported by the Cancer Genomics Center and Netherlands Bioinformatic Center through Netherlands Genomics Initiative (to E.C.), Fred Hutchinson Cancer Research Center and Canary Foundation funding (New Development funds to M.T.), Core Center of Excellence in Hematology Pilot Grant P30 DK56465 (to M.T.), Pacific Northwest Prostate Cancer Specialized Program of Research Excellence Grant P50 CA97186 (to M.T.) and a Rosetta Inpharmatics Fellowship in Molecular Profiling (to S.K.W.). We thank P. Toonen for animal care. COMPETING INTERESTS STATEMENT The authors declare competing financial interests: details accompany the full- text HTML version of the paper at http://www.nature.com/naturemethods/. Sam E V Linsen 1,7 , Elzo de Wit 1,7 , Georges Janssens 1 , Sheila Heater 2 , Laura Chapman 2 , Rachael K Parkin 3 , Brian Fritz 3,6 , Stacia K Wyman 3 , Ewart de Bruijn 1 , Emile E Voest 4 , Scott Kuersten 2 , Muneesh Tewari 3,5 & Edwin Cuppen 1 1 Hubrecht Institute and University Medical Center Utrecht, Cancer Genomics Center, Utrecht, The Netherlands. 2 Life Technologies, Austin, Texas, USA. 3 Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA. 4 Department of Medical Oncology, University Medical Center Utrecht, Utrecht, The Netherlands. 5 Clinical Research Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA. 6 Present address: Illumina, Inc., San Diego, California, USA. 7 These authors contributed equally to this work. e-mail: e.cuppen@niob.knaw.nl 1. Berezikov, E., et al. Genome Res. 16, 1289–1298 (2006). 2. Ruby, J.G. et al. Cell 127, 1193–1207 (2006). 3. Kuchenbauer, F. et al. Genome Res. 18, 1787–1797 (2008). 4. Berezikov, E. et al. Nat. Genet. 38, 1375–1377 (2006). 5. Lau, N.C., Lim, L.P., Weinstein, E.G. & Bartel, D.P. Science 294, 858–862 (2001). 6. Griffiths-Jones, S., Grocock, R.J., van Dongen, S., Bateman, A. & Enright, A.J. Nucleic Acids Res. 34, D140–D144 (2006). 7. Romaniuk, E., McLaughlin, L.W., Neilson, T. & Romaniuk, P.J. Eur. J. Biochem. 125, 639–643 (1982). 8. Taube, R., Loya, S., Avidan, O., Perach, M. & Hizi, A. Biochem. J. 329, 579–587 (1998). RNAiCut: automated detection of significant genes from functional genomic screens To the Editor: RNA interference (RNAi) is a popular functional genomic technology for identifying genes involved in a biological process. Although higher scores for genes in an RNAi screen suggest more central roles in the pathway, estimating the score threshold separating pathway- or process-relevant hits from noise remains dif- ficult (Supplementary Table 1) and is typically done manually. To overcome this subjective approach, we built a fully auto- mated system, RNAiCut, that objectively and robustly identifies score thresholds from functional genomic data by introducing the use of the connectivity of subgraphs of protein-protein interac- tion (PPI) networks 1,2 . Unlike some previous work 3 , our method does not overlap RNAi and PPI data to find interacting regulators. Instead, its guiding hypothesis is that true positive hits in an RNAi experiment are densely interconnected in the PPI network. For the k highest-scoring genes (k = 1, 2, 3…), RNAiCut computes the edge count of the induced subgraph and estimates the P-value of finding a PPI subgraph of at least this size that is induced by k randomly cho- sen nodes that have the same degrees as these genes (Supplementary Methods and Supplementary Results). The plot of these P-values as a function of k is typically V-shaped, and we take the global min- imum as the score threshold (Fig. 1). We used RNAiCut to com- pute thresholds for several Drosophila melanogaster RNAi screens 4 (Supplementary Figs. 110 and Supplementary Tables 23). RNAiCut chose successful thresholds, as measured by Gene Ontology (GO) 5 enrichment: the gene lists with above-threshold scores were enriched for functions relevant to the screen, compared to the rest (Supplementary Table 4). When the manual screener’s threshold was later in the ranked list of hits than the RNAiCut threshold, choosing RNAiCut’s threshold may reduce the poten- tially high number of false positives. When RNAiCut’s threshold was later, the GO enrichment for RNAiCut’s cutoff was at least as good as for the manually determined cutoff, revealing additional pathway-relevant genes (Supplementary Results). Although some of the additional hits identified by RNAiCut may be false positives, analyzing them may be useful given their apparent connectivity to © 2009 Nature America, Inc. All rights reserved.