Skip to main navigation Skip to search Skip to main content

A comprehensive database for biological data derived from sewage in five European cities

  • Ágnes Becsei*
  • , Patrick Munk
  • , Alessandro Fuschi
  • , Saria Otani
  • , József Stéger
  • , Dávid Visontai
  • , Krisztián Papp
  • , Christian Brinch
  • , Ravi Kant
  • , Ilya Weinstein
  • , Olli Vapalahti
  • , Miranda de Graaf
  • , Claudia M. E. Schapendonk
  • , Jeroen Roelfsema
  • , Maaike van den Beld
  • , Roan Pijnacker
  • , Eelco Franz
  • , Patricia Alba
  • , Antonio Battisti
  • , Alessandra De Cesare
  • Valentina Indio, Fulvia Troja, Tarja Sironen, Chiara Oliveri, Frédérique Pasquali, Ivan Liachko, Benjamin Auch, Colman O'Cathail, Krisztián Bányai, Magdolna Makó, Péter Pollner, Marion Koopmans, Istvan Csabai, Daniel Remondini, Frank M. Aarestrup
*Corresponding author for this work
  • Eötvös Loránd University
  • University of Bologna
  • Medical University of Gdańsk
  • University of Helsinki
  • Erasmus University Medical Center
  • National Institute of Public Health and the Environment
  • Experimental Zooprophylactic Institute of Lazio and Toscana
  • Phase Genomics, Inc.
  • European Bioinformatics Institute
  • University of Veterinary Medicine Budapest
  • Budapest Sewage Works Pte. Ltd
  • Health Services Management Training Centre

Research output: Contribution to journalJournal articleResearchpeer-review

48 Downloads (Orbit)

Abstract

Sewage metagenomics is a powerful tool for proactive pathogen surveillance and understanding microbial community dynamics. To support such efforts, we present a highly curated and accessible longitudinal dataset of 239 sewage samples collected from five European cities. The dataset, processed through metagenomic sequencing, includes rich analytical outputs such as taxonomic profiles, identified antimicrobial resistance genes, assembled contigs with annotated origins, metagenome-assembled genomes with functional gene annotations, and metadata. Given the computational intensity and time required to reproduce such analyses, we share this dataset to promote reuse and advance research. In addition to the metagenomic data, qPCR was used to identify specific pathogens, and Hi-C sequencing was performed on a subset of the samples to strengthen genomic linkage analysis. Central to this resource is a publicly available PostgreSQL database, designed to facilitate efficient exploration and reuse of the data. This comprehensive database allows users to perform targeted queries, subset data, and streamline access to this extensive resource.
Original languageEnglish
Article numberbaaf089
JournalDatabase: The Journal of Biological Databases and Curation
Volume2026
Number of pages7
ISSN1758-0463
DOIs
Publication statusPublished - 2026

Fingerprint

Dive into the research topics of 'A comprehensive database for biological data derived from sewage in five European cities'. Together they form a unique fingerprint.

Cite this