Details zur Publikation

Kategorie Textpublikation
Referenztyp Zeitschriften
DOI 10.1093/bioadv/vbad069
Lizenz creative commons licence
Titel (primär) StandEnA: a customizable workflow for standardized annotation and generating a presence–absence matrix of proteins
Autor Chafra, F.; Borim Correa, F.; Oni, F.; Konu Karakayalı, Ö.; Stadler, P.F.; Nunes da Rocha, U.
Quelle Bioinformatics Advances
Erscheinungsjahr 2023
Department UMB
Band/Volume 3
Heft 1
Seite von vbad069
Sprache englisch
Topic T7 Bioeconomy
Supplements https://oup.silverchair-cdn.com/oup/backfile/Content_public/Journal/bioinformaticsadvances/3/1/10.1093_bioadv_vbad069/2/vbad069_supplementary_data.zip?Expires=1693409744&Signature=TqpUaQTPCGYmnqYPmYOpW2TGH2lP2v2~O1V-Ce6nGnJaMGq2KB1zcsKffgtHW2pB6vpxaQtDyvXLysQi9slX2kBMlcG1hrwbn3kTt3MoahM42wuG4OrnjwxI6RiOoleVfFwKVkS~-jKxDpNvtj0NnqInCdF-9XASCzpCkaTqR5eJWukGUk1Q1~Wk578~hOjhR7-DFj1slXhAhAg8DGdVxn928~M9IXrTCONaMQj0vyaLR6VrVW7EO5K6TD5mH~rrU6yQML~7KOp56Np4kiYfT4cB2ndEBuZ-00PY4dXKTQmlqQZ4LZmHMSZA9rgKMMDP-p55wVAFQkKOXtgcefyWyw__&Key-Pair-Id=APKAIE5G5CRDK6RD3PGA
Abstract Motivation

Several genome annotation tools standardize annotation outputs for comparability. During standardization, these tools do not allow user-friendly customization of annotation databases; limiting their flexibility and applicability in downstream analysis.
Results

StandEnA is a user-friendly command-line tool for Linux that facilitates the generation of custom databases by retrieving protein sequences from multiple databases. Directed by a user-defined list of standard names, StandEnA retrieves synonyms to search for corresponding sequences in a set of public databases. Custom databases are used in prokaryotic genome annotation to generate standardized presence–absence matrices and reference files containing standard database identifiers. To showcase StandEnA, we applied it to six metagenome-assembled genomes to analyze three different pathways.
dauerhafte UFZ-Verlinkung https://www.ufz.de/index.php?en=20939&ufzPublicationIdentifier=27393
Chafra, F., Borim Correa, F., Oni, F., Konu Karakayalı, Ö., Stadler, P.F., Nunes da Rocha, U. (2023):
StandEnA: a customizable workflow for standardized annotation and generating a presence–absence matrix of proteins
Bioinform. Adv. 3 (1), vbad069 10.1093/bioadv/vbad069