Details zur Publikation

Kategorie Textpublikation
Referenztyp Zeitschriften
DOI 10.1093/gigascience/giaa135
Lizenz creative commons licence
Titel (primär) Dadasnake, a Snakemake implementation of DADA2 to process amplicon sequencing data for microbial ecology
Autor Weißbecker, C.; Schnabel, B.; Heintz-Buschart, A.
Quelle GigaScience
Erscheinungsjahr 2020
Department BOOEK; iDiv
Band/Volume 9
Heft 12
Seite von giaa135
Sprache englisch
Daten-/Softwarelinks http://doi.org/10.5281/zenodo.4190897
Supplements https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/gigascience/9/12/10.1093_gigascience_giaa135/2/giaa135_supplemental_files.zip?Expires=1616078524&Signature=TvgHSN-Z6SeTRDuzf0LZ1K8pQr7yBr5ZznfV~2SZSfO4O~65JwaAs2jYCGOVPsBC5UKBQuu9SEmUWhXNQdSCjmlNYv9xMygz8FXr05XqTWLUgv5fPdOk5PYCU~dgiAPzJvsiTVKwFgtNspnW5tU7omwnYOwfuciXEszB3sxicjeS33cvpse496BKVJqWm8l2lx5xrqtp-5gor-x~MAl6LrPYlXmF3zs2SJ3g2V7vfjrg8b7GkiK0L4rHgsR04cUeoJoRB6s~XKWDxNzgz0NQmexQu-luUEgRU09UrZB81cvHRM0LoAaNeGkVwSdxScMvv0TDVrQBMWWWbMDpkXmGZQ__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA
Keywords rRNA gene sequence analysis; denoising; exact sequence variants; R; pipeline; microbiome; community structure
Abstract

Background

Amplicon sequencing of phylogenetic marker genes, e.g., 16S, 18S, or ITS ribosomal RNA sequences, is still the most commonly used method to determine the composition of microbial communities. Microbial ecologists often have expert knowledge on their biological question and data analysis in general, and most research institutes have computational infrastructures to use the bioinformatics command line tools and workflows for amplicon sequencing analysis, but requirements of bioinformatics skills often limit the efficient and up-to-date use of computational resources.

Results

We present dadasnake, a user-friendly, 1-command Snakemake pipeline that wraps the preprocessing of sequencing reads and the delineation of exact sequence variants by using the favorably benchmarked and widely used DADA2 algorithm with a taxonomic classification and the post-processing of the resultant tables, including hand-off in standard formats. The suitability of the provided default configurations is demonstrated using mock community data from bacteria and archaea, as well as fungi.

Conclusions

By use of Snakemake, dadasnake makes efficient use of high-performance computing infrastructures. Easy user configuration guarantees flexibility of all steps, including the processing of data from multiple sequencing platforms. It is easy to install dadasnake via conda environments. dadasnake is available at https://github.com/a-h-b/dadasnake.

dauerhafte UFZ-Verlinkung https://www.ufz.de/index.php?en=20939&ufzPublicationIdentifier=23965
Weißbecker, C., Schnabel, B., Heintz-Buschart, A. (2020):
Dadasnake, a Snakemake implementation of DADA2 to process amplicon sequencing data for microbial ecology
GigaScience 9 (12), giaa135 10.1093/gigascience/giaa135