About OpenKnowledge@NAU | For NAU Authors

MetaGeniE: Characterizing human clinical samples using deep metagenomic sequencing

Rawat, Arun and Engelthaler, David M. and Driebe, Elizabeth M. and Keim, Paul and Foster, Jeffrey T. (2014) MetaGeniE: Characterizing human clinical samples using deep metagenomic sequencing. PLoS ONE, 9 (11). e11091. ISSN 1932-6203

Rawat_A_etal_2014_MetaGeniE_Characterizing_Human.pdf - Published Version
Available under License Creative Commons Attribution.

Download (1MB) | Preview
Publisher’s or external URL: http://dx.doi.org/10.1371/journal.pone.0110915


With the decreasing cost of next-generation sequencing, deep sequencing of clinical samples provides unique opportunities to understand host-associated microbial communities. Among the primary challenges of clinical metagenomic sequencing is the rapid filtering of human reads to survey for pathogens with high specificity and sensitivity. Metagenomes are inherently variable due to different microbes in the samples and their relative abundance, the size and architecture of genomes, and factors such as target DNA amounts in tissue samples (i.e. human DNA versus pathogen DNA concentration). This variation in metagenomes typically manifests in sequencing datasets as low pathogen abundance, a high number of host reads, and the presence of close relatives and complex microbial communities. In addition to these challenges posed by the composition of metagenomes, high numbers of reads generated from high-throughput deep sequencing pose immense computational challenges. Accurate identification of pathogens is confounded by individual reads mapping to multiple different reference genomes due to gene similarity in different taxa present in the community or close relatives in the reference database. Available global and local sequence aligners also vary in sensitivity, specificity, and speed of detection. The efficiency of detection of pathogens in clinical samples is largely dependent on the desired taxonomic resolution of the organisms. We have developed an efficient strategy that identifies “all against all” relationships between sequencing reads and reference genomes. Our approach allows for scaling to large reference databases and then genome reconstruction by aggregating global and local alignments, thus allowing genetic characterization of pathogens at higher taxonomic resolution. These results were consistent with strain level SNP genotyping and bacterial identification from laboratory culture.

Item Type: Article
ID number or DOI: 10.1371/journal.pone.0110915
Keywords: Bacterial genomics; Biology and life sciences; Communities; Computational biology; cystic-fibrosis; datasets; DNA sequencing; Ecology; Genetics; Genome analysis; Genomic databases; Genomics; Metagenomics; Microbes; microbial ecology; Microbial genomics; microbiome; Microorganisms; pathogen identification; platforms; Research Article; short read alignment; Single nucleotide polymorphisms; specimens; tool
Subjects: Q Science > QH Natural history > QH301 Biology
Q Science > QH Natural history > QH426 Genetics
NAU Depositing Author Academic Status: Faculty/Staff
Department/Unit: Research Centers > Center for Microbial Genetics and Genomics
Date Deposited: 16 Oct 2015 17:52
URI: http://openknowledge.nau.edu/id/eprint/1700

Actions (login required)

IR Staff Record View IR Staff Record View


Downloads per month over past year