Draft Genome Sequence of Pseudomonas aeruginosa Strain ATCC 27853 Xiangqun Fang, a Zhiwei Fang, b Jiao Zhao, b Yuanqiang Zou, b Tianzhi Li, a Junfeng Wang, a Yinghua Guo, a De Chang, a Longxiang Su, a Peixiang Ni, b and Changting Liu a Nanlou Respiratory Diseases Department, Chinese PLA General Hospital, Beijing, China, a and BGI-Shenzhen, Shenzhen, People’s Republic of China b Pseudomonas aeruginosa is a common bacterium that can cause disease. The versatility of Pseudomonas aeruginosa enables the organism to infect damaged tissues or those with reduced immunity which cause inﬂammation and sepsis. Here we report the genome sequence of the strain ATCC 27853. P seudomonas aeruginosa is a common bacterium that can cause disease in animals and humans. It is found in soil, water, skin ﬂora, and most man-made environments throughout the world. It is an opportunistic pathogen for both humans and plants (2). Pseu- domonas aeruginosa ATCC 27853 is usually used to test antimicrobial activity (6). Its genome sequencing will help us to understand the pathogenesis of this pathogen. Pseudomonas aeruginosa ATCC 27853 was obtained from the China General Microbiological Culture Col- lection Center (CGMCC) as CGMCC 1.2387. A total of 850 million base pairs of reads were generated using Illumina Genome Analyzer II at BGI-Shenzhen (BGI; Shenzhen, China), and 296 contigs in 124 scaffolds were assembled with the SOAPdenovo program (4) based on paired-end reads. The ge- nome sequence of the strain ATCC 27853 was estimated to be 6.46 Mb in size based on 15Kmer analysis, while a total of 6,887,913 bp (scaffold length) were assembled and the G+C content was deter- mined to be 66.15%. All reads provided about 111-fold coverage of the genome. Fifty-four copies of tRNA genes were predicted by the tRNAscan-SE server (5). The prediction of rRNA copies was not accurate because the short reads (average length of 100 bp) generated by Illumina technology are not suitable for its predic- tion. In addition, 5S, 16S, and 23S rRNAs were identiﬁed using RNAmmer (3), and the total lengths were 114 bp, 1,523 bp, and 2,888 bp, respectively. In total, 6,474 putative open reading frames were identiﬁed using Glimmer v.3.0 (1), with the length ranging from 114 bp to 13,029 bp, and the average length was 946 bp, giving a coding intensity of 88.68%. A total of 3,240 protein-cod- ing sequences (CDSs) were located in one strand, and the other 3,243 were located in the other strand. All CDSs were translated into amino acid sequences and searched against the COG, KEGG, Swiss-Prot, TrEMBL, and NR databases. Based on the searching results, 3,391 CDSs were classiﬁed into 22 functional COG groups, and 5,403, 2,959, 6,196, and 3,806 CDSs were assigned to the NR, Swiss-Prot, TrEMBL, and KEGG databases, respectively. Two hundred thirty-two CDSs have no annotation information in these databases at all. Comparative genomic analyses were performed using the genome sequence of Pseudomonas aeruginosa PAO1 (GenBank accession number NC_002516.2) as a reference. All reads were aligned against strain PAO1 using SOAPaligner, and the genome coverage was deter- mined to be 95% and the genome depth was 120. Accordingly, 95% of the total gene length of PAO1 was covered by our reads. All four known plasmids (GenBank accession numbers NC_008357, NC_009739, NC_010722, and NC_007100) found in Pseudomo- nas aeruginosa were also aligned with the reads, using SOAPaligner to determine the possible plasmid in our strain. Unfortunately, the genome coverage was less than 10% for all four plasmids. A plasmid (GenBank accession number NC_010891) derived from Pseudomo- nas sp. CT14 was covered more than 31%, and the covered region was composed of two fragments and the coverage was 120, indicating that the two fragments might have originated from this plasmid and been integrated into the genome. Nucleotide sequence accession numbers. This whole-genome shotgun project has been deposited at DDBJ/EMBL/GenBank un- der the accession number AJKG00000000. The version described in this paper is the ﬁrst version, AJKG01000000. ACKNOWLEDGMENTS This work was supported by the Key Pre-Research Foundation of Military Equipment of China (grant no. 9140A26040312JB1001), the opening foundation of the State Key Laboratory of Space Medicine Fundamentals and Application, Chinese Astronaut Research and Training Center (no. SMFA11K02), the Special Financial Grant from the China Postdoctoral Science Foundation (no. 201104776), and the National Natural Science Foundation of China (no. 81000018). REFERENCES 1. Delcher AL, Bratke KA, Powers EC, Salzberg SL. 2007. Identifying bac- terial genes and endosymbiont DNA with Glimmer. Bioinformatics 23: 673– 679. 2. Iglewski BH. 1996. Pseudomonas, ch 27. In Baron S (ed), Medical microbiol- ogy, 4th ed. University of Texas Medical Branch, Galveston, TX. 3. Lagesen K, et al. 2007. RNAmmer: consistent and rapid annotation of ribosomal RNA genes. Nucleic Acids Res. 35:3100 –3108. 4. Li R, et al. 2010. De novo assembly of human genomes with massively parallel short read sequencing. Genome Res. 20:265–272. 5. Lowe TM, Eddy SR. 1997. tRNAscan-SE: a program for improved detec- tion of transfer RNA genes in genomic sequence. Nucleic Acids Res. 25: 955–964. 6. Matzneller P, Manaﬁ M, Zeitlinger M. 2011. Antimicrobial effect of statins: organic solvents might falsify microbiological testing results. Int. J. Clin. Pharmacol. Ther. 49:666 – 671. Received 24 April 2012 Accepted 30 April 2012 Address correspondence to Peixiang Ni, nipx@genomics.cn, or Changting Liu, liuchangt@gmail.com. X.F. and Z.F. contributed equally to this work. Copyright © 2012, American Society for Microbiology. All Rights Reserved. doi:10.1128/JB.00690-12 GENOME ANNOUNCEMENT July 2012 Volume 194 Number 14 Journal of Bacteriology p. 3755 jb.asm.org 3755 on February 26, 2020 by guest http://jb.asm.org/ Downloaded from