Publication Details

Category Text Publication
Reference Category Journals
DOI 10.1186/s40168-020-00993-9
Licence creative commons licence
Title (Primary) PathoFact: a pipeline for the prediction of virulence factors and antimicrobial resistance genes in metagenomic data
Author de Nies, L.; Lopes, S.; Busi, S.B.; Galata, V.; Heintz-Buschart, A.; Laczny, C.C.; May, P.; Wilmes, P.
Source Titel Microbiome
Year 2021
Department BOOEK; iDiv
Volume 9
Page From art. 49
Language englisch
Topic T5 Future Landscapes
Supplements https://static-content.springer.com/esm/art%3A10.1186%2Fs40168-020-00993-9/MediaObjects/40168_2020_993_MOESM2_ESM.pdf
Keywords Virulence factors; Bacterial toxins; Antimicrobial resistance; Mobile genetic elements; Metagenomics; Microbiome; Bioinformatics
Abstract Background
Pathogenic microorganisms cause disease by invading, colonizing, and damaging their host. Virulence factors including bacterial toxins contribute to pathogenicity. Additionally, antimicrobial resistance genes allow pathogens to evade otherwise curative treatments. To understand causal relationships between microbiome compositions, functioning, and disease, it is essential to identify virulence factors and antimicrobial resistance genes in situ. At present, there is a clear lack of computational approaches to simultaneously identify these factors in metagenomic datasets.

Results
Here, we present PathoFact, a tool for the contextualized prediction of virulence factors, bacterial toxins, and antimicrobial resistance genes with high accuracy (0.921, 0.832 and 0.979, respectively) and specificity (0.957, 0.989 and 0.994). We evaluate the performance of PathoFact on simulated metagenomic datasets and perform a comparison to two other general workflows for the analysis of metagenomic data. PathoFact outperforms all existing workflows in predicting virulence factors and toxin genes. It performs comparably to one pipeline regarding the prediction of antimicrobial resistance while outperforming the others. We further demonstrate the performance of PathoFact on three publicly available case-control metagenomic datasets representing an actual infection as well as chronic diseases in which either pathogenic potential or bacterial toxins are hypothesized to play a role. In each case, we identify virulence factors and AMR genes which differentiated between the case and control groups, thereby revealing novel gene associations with the studied diseases.

Conclusion
PathoFact is an easy-to-use, modular, and reproducible pipeline for the identification of virulence factors, bacterial toxins, and antimicrobial resistance genes in metagenomic data. Additionally, our tool combines the prediction of these pathogenicity factors with the identification of mobile genetic elements. This provides further depth to the analysis by considering the genomic context of the pertinent genes. Furthermore, PathoFact’s modules for virulence factors, toxins, and antimicrobial resistance genes can be applied independently, thereby making it a flexible and versatile tool. PathoFact, its models, and databases are freely available at https://pathofact.lcsb.uni.lu.
Persistent UFZ Identifier https://www.ufz.de/index.php?en=20939&ufzPublicationIdentifier=24268
de Nies, L., Lopes, S., Busi, S.B., Galata, V., Heintz-Buschart, A., Laczny, C.C., May, P., Wilmes, P. (2021):
PathoFact: a pipeline for the prediction of virulence factors and antimicrobial resistance genes in metagenomic data
Microbiome 9 , art. 49 10.1186/s40168-020-00993-9