Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample

James G Caporaso, Christian L. Lauber, William A. Walters, Donna Berg-Lyons, Catherine A. Lozupone, Peter J. Turnbaugh, Noah Fierer, Rob Knight

Research output: Contribution to journalArticle

2373 Citations (Scopus)

Abstract

The ongoing revolution in high-throughput sequencing continues to democratize the ability of small groups of investigators to map the microbial component of the biosphere. In particular, the coevolution of new sequencing platforms and new software tools allows data acquisition and analysis on an unprecedented scale. Here we report the next stage in this coevolutionary arms race, using the Illumina GAIIx platform to sequence a diverse array of 25 environmental samples and three known "mock communities" at a depth averaging 3.1 million reads per sample. We demonstrate excellent consistency in taxonomic recovery and recapture diversity patterns that were previously reported on the basis of meta-analysis of many studies from the literature (notably, the saline/nonsaline split in environmental samples and the split between host-associated and free-living communities). We also demonstrate that 2,000 Illumina single-end reads are sufficient to recapture the same relationships among samples that we observe with the full dataset. The results thus open up the possibility of conducting large-scale studies analyzing thousands of samples simultaneously to survey microbial communities at an unprecedented spatial and temporal resolution.

Original languageEnglish (US)
Pages (from-to)4516-4522
Number of pages7
JournalProceedings of the National Academy of Sciences of the United States of America
Volume108
Issue numberSUPPL. 1
DOIs
StatePublished - Mar 15 2011
Externally publishedYes

Fingerprint

Meta-Analysis
Software
Research Personnel
Surveys and Questionnaires
Datasets

Keywords

  • Human microbiome
  • Microbial community analysis
  • Microbial ecology
  • Next-generation sequencing

ASJC Scopus subject areas

  • General

Cite this

Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. / Caporaso, James G; Lauber, Christian L.; Walters, William A.; Berg-Lyons, Donna; Lozupone, Catherine A.; Turnbaugh, Peter J.; Fierer, Noah; Knight, Rob.

In: Proceedings of the National Academy of Sciences of the United States of America, Vol. 108, No. SUPPL. 1, 15.03.2011, p. 4516-4522.

Research output: Contribution to journalArticle

Caporaso, JG, Lauber, CL, Walters, WA, Berg-Lyons, D, Lozupone, CA, Turnbaugh, PJ, Fierer, N & Knight, R 2011, 'Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample', Proceedings of the National Academy of Sciences of the United States of America, vol. 108, no. SUPPL. 1, pp. 4516-4522. https://doi.org/10.1073/pnas.1000080107
Caporaso, James G ; Lauber, Christian L. ; Walters, William A. ; Berg-Lyons, Donna ; Lozupone, Catherine A. ; Turnbaugh, Peter J. ; Fierer, Noah ; Knight, Rob. / Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample. In: Proceedings of the National Academy of Sciences of the United States of America. 2011 ; Vol. 108, No. SUPPL. 1. pp. 4516-4522.
@article{16cb2412198140558286f73596dcb556,
title = "Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample",
abstract = "The ongoing revolution in high-throughput sequencing continues to democratize the ability of small groups of investigators to map the microbial component of the biosphere. In particular, the coevolution of new sequencing platforms and new software tools allows data acquisition and analysis on an unprecedented scale. Here we report the next stage in this coevolutionary arms race, using the Illumina GAIIx platform to sequence a diverse array of 25 environmental samples and three known {"}mock communities{"} at a depth averaging 3.1 million reads per sample. We demonstrate excellent consistency in taxonomic recovery and recapture diversity patterns that were previously reported on the basis of meta-analysis of many studies from the literature (notably, the saline/nonsaline split in environmental samples and the split between host-associated and free-living communities). We also demonstrate that 2,000 Illumina single-end reads are sufficient to recapture the same relationships among samples that we observe with the full dataset. The results thus open up the possibility of conducting large-scale studies analyzing thousands of samples simultaneously to survey microbial communities at an unprecedented spatial and temporal resolution.",
keywords = "Human microbiome, Microbial community analysis, Microbial ecology, Next-generation sequencing",
author = "Caporaso, {James G} and Lauber, {Christian L.} and Walters, {William A.} and Donna Berg-Lyons and Lozupone, {Catherine A.} and Turnbaugh, {Peter J.} and Noah Fierer and Rob Knight",
year = "2011",
month = "3",
day = "15",
doi = "10.1073/pnas.1000080107",
language = "English (US)",
volume = "108",
pages = "4516--4522",
journal = "Proceedings of the National Academy of Sciences of the United States of America",
issn = "0027-8424",
number = "SUPPL. 1",

}

TY - JOUR

T1 - Global patterns of 16S rRNA diversity at a depth of millions of sequences per sample

AU - Caporaso, James G

AU - Lauber, Christian L.

AU - Walters, William A.

AU - Berg-Lyons, Donna

AU - Lozupone, Catherine A.

AU - Turnbaugh, Peter J.

AU - Fierer, Noah

AU - Knight, Rob

PY - 2011/3/15

Y1 - 2011/3/15

N2 - The ongoing revolution in high-throughput sequencing continues to democratize the ability of small groups of investigators to map the microbial component of the biosphere. In particular, the coevolution of new sequencing platforms and new software tools allows data acquisition and analysis on an unprecedented scale. Here we report the next stage in this coevolutionary arms race, using the Illumina GAIIx platform to sequence a diverse array of 25 environmental samples and three known "mock communities" at a depth averaging 3.1 million reads per sample. We demonstrate excellent consistency in taxonomic recovery and recapture diversity patterns that were previously reported on the basis of meta-analysis of many studies from the literature (notably, the saline/nonsaline split in environmental samples and the split between host-associated and free-living communities). We also demonstrate that 2,000 Illumina single-end reads are sufficient to recapture the same relationships among samples that we observe with the full dataset. The results thus open up the possibility of conducting large-scale studies analyzing thousands of samples simultaneously to survey microbial communities at an unprecedented spatial and temporal resolution.

AB - The ongoing revolution in high-throughput sequencing continues to democratize the ability of small groups of investigators to map the microbial component of the biosphere. In particular, the coevolution of new sequencing platforms and new software tools allows data acquisition and analysis on an unprecedented scale. Here we report the next stage in this coevolutionary arms race, using the Illumina GAIIx platform to sequence a diverse array of 25 environmental samples and three known "mock communities" at a depth averaging 3.1 million reads per sample. We demonstrate excellent consistency in taxonomic recovery and recapture diversity patterns that were previously reported on the basis of meta-analysis of many studies from the literature (notably, the saline/nonsaline split in environmental samples and the split between host-associated and free-living communities). We also demonstrate that 2,000 Illumina single-end reads are sufficient to recapture the same relationships among samples that we observe with the full dataset. The results thus open up the possibility of conducting large-scale studies analyzing thousands of samples simultaneously to survey microbial communities at an unprecedented spatial and temporal resolution.

KW - Human microbiome

KW - Microbial community analysis

KW - Microbial ecology

KW - Next-generation sequencing

UR - http://www.scopus.com/inward/record.url?scp=79952005915&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79952005915&partnerID=8YFLogxK

U2 - 10.1073/pnas.1000080107

DO - 10.1073/pnas.1000080107

M3 - Article

VL - 108

SP - 4516

EP - 4522

JO - Proceedings of the National Academy of Sciences of the United States of America

JF - Proceedings of the National Academy of Sciences of the United States of America

SN - 0027-8424

IS - SUPPL. 1

ER -