TY - JOUR
T1 - A comprehensive database for biological data derived from sewage in five European cities
AU - Becsei, Ágnes
AU - Munk, Patrick
AU - Fuschi, Alessandro
AU - Otani, Saria
AU - Stéger, József
AU - Visontai, Dávid
AU - Papp, Krisztián
AU - Brinch, Christian
AU - Kant, Ravi
AU - Weinstein, Ilya
AU - Vapalahti, Olli
AU - de Graaf, Miranda
AU - Schapendonk, Claudia M. E.
AU - Roelfsema, Jeroen
AU - van den Beld, Maaike
AU - Pijnacker, Roan
AU - Franz, Eelco
AU - Alba, Patricia
AU - Battisti, Antonio
AU - De Cesare, Alessandra
AU - Indio, Valentina
AU - Troja, Fulvia
AU - Sironen, Tarja
AU - Oliveri, Chiara
AU - Pasquali, Frédérique
AU - Liachko, Ivan
AU - Auch, Benjamin
AU - O'Cathail, Colman
AU - Bányai, Krisztián
AU - Makó, Magdolna
AU - Pollner, Péter
AU - Koopmans, Marion
AU - Csabai, Istvan
AU - Remondini, Daniel
AU - Aarestrup, Frank M.
PY - 2026
Y1 - 2026
N2 - Sewage metagenomics is a powerful tool for proactive pathogen surveillance and understanding microbial community dynamics. To support such efforts, we present a highly curated and accessible longitudinal dataset of 239 sewage samples collected from five European cities. The dataset, processed through metagenomic sequencing, includes rich analytical outputs such as taxonomic profiles, identified antimicrobial resistance genes, assembled contigs with annotated origins, metagenome-assembled genomes with functional gene annotations, and metadata. Given the computational intensity and time required to reproduce such analyses, we share this dataset to promote reuse and advance research. In addition to the metagenomic data, qPCR was used to identify specific pathogens, and Hi-C sequencing was performed on a subset of the samples to strengthen genomic linkage analysis. Central to this resource is a publicly available PostgreSQL database, designed to facilitate efficient exploration and reuse of the data. This comprehensive database allows users to perform targeted queries, subset data, and streamline access to this extensive resource.
AB - Sewage metagenomics is a powerful tool for proactive pathogen surveillance and understanding microbial community dynamics. To support such efforts, we present a highly curated and accessible longitudinal dataset of 239 sewage samples collected from five European cities. The dataset, processed through metagenomic sequencing, includes rich analytical outputs such as taxonomic profiles, identified antimicrobial resistance genes, assembled contigs with annotated origins, metagenome-assembled genomes with functional gene annotations, and metadata. Given the computational intensity and time required to reproduce such analyses, we share this dataset to promote reuse and advance research. In addition to the metagenomic data, qPCR was used to identify specific pathogens, and Hi-C sequencing was performed on a subset of the samples to strengthen genomic linkage analysis. Central to this resource is a publicly available PostgreSQL database, designed to facilitate efficient exploration and reuse of the data. This comprehensive database allows users to perform targeted queries, subset data, and streamline access to this extensive resource.
U2 - 10.1093/database/baaf089
DO - 10.1093/database/baaf089
M3 - Journal article
C2 - 41556498
SN - 1758-0463
VL - 2026
JO - Database: The Journal of Biological Databases and Curation
JF - Database: The Journal of Biological Databases and Curation
M1 - baaf089
ER -