A unified resource for transcriptional regulation in Escherichia coli K-12 incorporating high-throughput-generated binding data into RegulonDB version 10.0

Alberto Santos-Zavaleta, Mishael Sanchez-Perez, Heladia Salgado, David A. Velazquez-Ramirez, Socorro Gama-Castro, Victor H. Tierrafria, Stephen J. W. Busby, Patricia Aquino, Xin Fang, Bernhard O. Palsson, James E. Galagan, Julio Collado-Vides*

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

337 Downloads (Pure)

Abstract

Background: Our understanding of the regulation of gene expression has benefited from the availability of high-throughput technologies that interrogate the whole genome for the binding of specific transcription factors and gene expression profiles. In the case of widely used model organisms, such as Escherichia coli K-12, the new knowledge gained from these approaches needs to be integrated with the legacy of accumulated knowledge from genetic and molecular biology experiments conducted in the pre-genomic era in order to attain the deepest level of understanding possible based on the available data.Results: In this paper, we describe an expansion of RegulonDB, the database containing the rich legacy of decades of classic molecular biology experiments supporting what we know about gene regulation and operon organization in E. coli K-12, to include the genome-wide dataset collections from 32 ChIP and 19 gSELEX publications, in addition to around 60 genome-wide expression profiles relevant to the functional significance of these datasets and used in their curation. Three essential features for the integration of this information coming from different methodological approaches are: first, a controlled vocabulary within an ontology for precisely defining growth conditions; second, the criteria to separate elements with enough evidence to consider them involved in gene regulation from isolated transcription factor binding sites without such support; and third, an expanded computational model supporting this knowledge. Altogether, this constitutes the basis for adequately gathering and enabling the comparisons and integration needed to manage and access such wealth of knowledge.Conclusions: This version 10.0 of RegulonDB is a first step toward what should become the unifying access point for current and future knowledge on gene regulation in E. coli K-12. Furthermore, this model platform and associated methodologies and criteria can be emulated for gathering knowledge on other microbial organisms.
Original languageEnglish
Article number91
JournalB M C Biology
Volume16
ISSN1741-7007
DOIs
Publication statusPublished - 2018

Bibliographical note

This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/)

Keywords

  • Transcriptional regulation
  • Transcriptomics
  • Integrative analyses
  • Systems biology
  • ChIP-seq
  • gSELEX

Fingerprint

Dive into the research topics of 'A unified resource for transcriptional regulation in Escherichia coli K-12 incorporating high-throughput-generated binding data into RegulonDB version 10.0'. Together they form a unique fingerprint.

Cite this