Publication Details

Category Text Publication
Reference Category Preprints
DOI 10.1101/2023.05.02.539054
Title (Primary) Simulation of 69 microbial communities indicates sequencing depth and false positives are major drivers of bias in Prokaryotic metagenome-assembled genome recovery
Author Nunes da Rocha, U.; Kasmanas, J.C.; Toscan, R.; Sanches, D.S.; Magnúsdóttir, S. ORCID logo ; Saraiva, J.P.
Source Titel bioRxiv
Year 2023
Department UMB
Language englisch
Topic T7 Bioeconomy
Abstract

We hypothesize that sample evenness, sequencing depth and taxonomic relatedness influence the recovery of metagenome-assembled genomes (MAGs). To test this hypothesis, we assessed MAG recovery in three in silico microbial communities composed of 42 species with the same richness but different sample evenness, sequencing depth and taxonomic distribution profiles using three different pipelines for MAG recovery.

The pipeline developed by Parks and colleagues (8K) generated the highest number of MAGs and the lowest number of true positives per community profile. The pipeline by Karst and colleagues (DT) showed the most accurate results (∼ 92%), outperforming the 8K and Multi-Metagenome pipeline (MM) developed by Albertsen and collaborators. Sequencing depth influenced the accurate recovery of genomes when using the 8K and MM, even with contrasting patterns: the MM pipeline recovered more MAGs found in the original communities when employing sequencing depths up to 60 million reads, whilst the 8K recovered more true positives in communities sequenced above 60 million reads. DT showed the best species recovery from the same genus, even though close-related species have a low recovery rate in all pipelines.

Our results highlight that more bins do not translate to the actual community composition and that sequencing depth plays a role in MAG recovery and increased community resolution. Even low MAG recovery error rates can significantly impact biological inferences. Our data indicates the scientific community should their findings from MAG recovery, especially when asserting novel species or metabolic traits.

Persistent UFZ Identifier https://www.ufz.de/index.php?en=20939&ufzPublicationIdentifier=27547
Nunes da Rocha, U., Kasmanas, J.C., Toscan, R., Sanches, D.S., Magnúsdóttir, S., Saraiva, J.P. (2023):
Simulation of 69 microbial communities indicates sequencing depth and false positives are major drivers of bias in Prokaryotic metagenome-assembled genome recovery
bioRxiv 10.1101/2023.05.02.539054