Phylogenetically typing bacterial strains from partial SNP genotypes observed from direct sequencing of clinical specimen metagenomic data

Jason W. Sahl, James M. Schupp, David A. Rasko, Rebecca E. Colman, Jeffrey T. Foster, Paul Keim

Research output: Contribution to journalArticle

19 Scopus citations

Abstract

We describe an approach for genotyping bacterial strains from low coverage genome datasets, including metagenomic data from complex samples. Sequence reads from unknown samples are aligned to a reference genome where the allele states of known SNPs are determined. The Whole Genome Focused Array SNP Typing (WG-FAST) pipeline can identify unknown strains with much less read data than is needed for genome assembly. To test WG-FAST, we resampled SNPs from real samples to understand the relationship between low coverage metagenomic data and accurate phylogenetic placement. WG-FAST can be downloaded from https://github.com/jasonsahl/wgfast.

Original languageEnglish (US)
Article number52
JournalGenome Medicine
Volume7
Issue number1
DOIs
StatePublished - Jun 9 2015

ASJC Scopus subject areas

  • Molecular Medicine
  • Molecular Biology
  • Genetics
  • Genetics(clinical)

Fingerprint Dive into the research topics of 'Phylogenetically typing bacterial strains from partial SNP genotypes observed from direct sequencing of clinical specimen metagenomic data'. Together they form a unique fingerprint.

  • Cite this