BioConductor
dada2 1.8.0
Accurate, high-resolution sample inference from amplicon sequencing data
Released Jan 30, 2018 by Benjamin Callahan
Dependencies
RcppParallel 4.4.1 data.table 1.11.4 ShortRead 1.38.0 ggplot2 3.0.0 Biostrings 2.48.0 BiocGenerics 0.26.0 XVector 0.20.0 IRanges 2.14.10 Rcpp reshape2 1.4.3
The dada2 package infers exact amplicon sequence variants (ASVs) from high-throughput amplicon sequencing data, replacing the coarser and less accurate OTU clustering approach. The dada2 pipeline takes as input demultiplexed fastq files, and outputs the sequence variants and their sample-wise abundances after removing substitution and chimera errors. Taxonomic classification is available via a native implementation of the RDP naive Bayesian classifier, and genus-species assignment by exact matching.