Phylogenetically typing bacterial strains from partial SNP genotypes observed from direct sequencing of clinical specimen metagenomic data

Jason W. Sahl, James M. Schupp, David A. Rasko, Rebecca E. Colman, Jeffrey T Foster, Paul S Keim

Research output: Contribution to journalArticle

19 Scopus citations


We describe an approach for genotyping bacterial strains from low coverage genome datasets, including metagenomic data from complex samples. Sequence reads from unknown samples are aligned to a reference genome where the allele states of known SNPs are determined. The Whole Genome Focused Array SNP Typing (WG-FAST) pipeline can identify unknown strains with much less read data than is needed for genome assembly. To test WG-FAST, we resampled SNPs from real samples to understand the relationship between low coverage metagenomic data and accurate phylogenetic placement. WG-FAST can be downloaded from

Original languageEnglish (US)
Article number52
JournalGenome Medicine
Issue number1
StatePublished - Jun 9 2015


ASJC Scopus subject areas

  • Genetics(clinical)
  • Genetics
  • Molecular Biology
  • Molecular Medicine

Cite this