Publication Details

Reference Category Journals
DOI / URL link
Creative Commons Licence creative commons licence
Title (Primary) MarineMetagenomeDB: a public repository for curated and standardized metadata for marine metagenomes
Author Kabiru Nata’ala, M.; Avila Santos, A.P.; Kasmanas, J.C.; Bartholomäus, A.; Saraiva, J.P.; Silva, S.G.; Keller-Costa, T.; Costa, R.; Gomes, N.C.M.; de Carvalho, A.C.P.L.F.; Stadler, P.F.; Sipoli Sanches, D.; Nunes da Rocha, U.
Journal Environmental Microbiome
Year 2022
Department UMB
Volume 17
Page From art. 57
Language englisch
Topic T7 Bioeconomy
Keywords Marine Microbiomes; Metagenomics; Microbial Ecology; Metadata; Database


Metagenomics is an expanding field within microbial ecology, microbiology, and related disciplines. The number of metagenomes deposited in major public repositories such as Sequence Read Archive (SRA) and Metagenomic Rapid Annotations using Subsystems Technology (MG-RAST) is rising exponentially. However, data mining and interpretation can be challenging due to mis-annotated and misleading entries of metadata. In this study, we describe the Marine Metagenome Metadata Database (MarineMetagenomeDB) to help researchers identify marine metagenomes of interest for re-analysis and meta-analysis. To this end, we have manually curated the associated metadata of several thousands of microbial metagenomes currently deposited at SRA and MG-RAST.


In total, 125 terms were curated according to 17 different classes (e.g., biome, material, oceanic zone, geographic feature, oceanographic phenomena). Other standardized features include sample attributes (e.g., salinity, depth), sample location (e.g., latitude, longitude), and sequencing features (e.g., sequencing platform, sequence count). MarineMetagenomeDB version 1.0 contains 11,449 marine metagenomes from SRA and MG-RAST distributed across all oceans and several seas. Most of the samples were sequenced using Illumina sequencing technology (84.33%). More than 55%of the samples were collected from the Pacific and the Atlantic Oceans. About 40% of the samples had their biomes assigned as ‘ocean’. The ‘Quick Search’ and ‘Advanced Search’ tabs allow users to use different filters to select samples of interest dynamically in the web app. The interactive map allows the visualization of samples based on their location on the world map. The web app is also equipped with a novel download tool (on both Windows and Linux operating systems), that allow easy download of raw sequence data of selected samples from their respective repositories. We demonstrated how users may use the MarineMetagenomeDB web app to select estuarine metagenomes for large-scale microbial biogeography studies quickly.


The MarineMetagenomeDB is a powerful resource for non-bioinformaticians to find marine metagenome samples with curated metadata and stimulate meta-studies involving marine microbiomes. Our user-friendly web app is publicly available at

Persistent UFZ Identifier
Kabiru Nata’ala, M., Avila Santos, A.P., Kasmanas, J.C., Bartholomäus, A., Saraiva, J.P., Silva, S.G., Keller-Costa, T., Costa, R., Gomes, N.C.M., de Carvalho, A.C.P.L.F., Stadler, P.F., Sipoli Sanches, D., Nunes da Rocha, U. (2022):
MarineMetagenomeDB: a public repository for curated and standardized metadata for marine metagenomes
Environ. Microbiome 17 , art. 57