A multi-scale expression and regulation knowledge base for Escherichia coli

Cameron R. Lamoureux, Katherine T. Decker, Anand V. Sastry, Kevin Rychel, Ye Gao, John Luke McConn, Daniel C. Zielinski*, Bernhard O. Palsson*

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

8 Downloads (Pure)


Transcriptomic data is accumulating rapidly; thus, scalable methods for extracting knowledge from this data are critical. Here, we assembled a top-down expression and regulation knowledge base for Escherichia coli. The expression component is a 1035-sample, high-quality RNA-seq compendium consisting of data generated in our lab using a single experimental protocol. The compendium contains diverse growth conditions, including: 9 media; 39 supplements, including antibiotics; 42 heterologous proteins; and 76 gene knockouts. Using this resource, we elucidated global expression patterns. We used machine learning to extract 201 modules that account for 86% of known regulatory interactions, creating the regulatory component. With these modules, we identified two novel regulons and quantified systems-level regulatory responses. We also integrated 1675 curated, publicly-available transcriptomes into the resource. We demonstrated workflows for analyzing new data against this knowledge base via deconstruction of regulation during aerobic transition. This resource illuminates the E. coli transcriptome at scale and provides a blueprint for top-down transcriptomic analysis of non-model organisms.
Original languageEnglish
Article numbergkad750
JournalNucleic Acids Research
Issue number19
Pages (from-to)10176-10193
Publication statusPublished - 2023


Dive into the research topics of 'A multi-scale expression and regulation knowledge base for Escherichia coli'. Together they form a unique fingerprint.

Cite this