The Salmonella enterica Pan-genome

Annika Jacobsen, Rene S. Hendriksen, Frank Møller Aarestrup, David Ussery, Carsten Friis

Research output: Contribution to journalJournal articleResearchpeer-review

764 Downloads (Pure)

Abstract

Salmonella enterica is divided into four subspecies containing a large number of different serovars, several of which are important zoonotic pathogens and some show a high degree of host specificity or host preference. We compare 45 sequenced S. enterica genomes that are publicly available (22 complete and 23 draft genome sequences). Of these, 35 were found to be of sufficiently good quality to allow a detailed analysis, along with two Escherichia coli strains (K-12 substr. DH10B and the avian pathogenic E. coli (APEC O1) strain). All genomes were subjected to standardized gene finding, and the core and pan-genome of Salmonella were estimated to be around 2,800 and 10,000 gene families, respectively. The constructed pan-genomic dendrograms suggest that gene content is often, but not uniformly correlated to serotype. Any given Salmonella strain has a large stable core, whilst there is an abundance of accessory genes, including the Salmonella pathogenicity islands (SPIs), transposable elements, phages, and plasmid DNA. We visualize conservation in the genomes in relation to chromosomal location and DNA structural features and find that variation in gene content is localized in a selection of variable genomic regions or islands. These include the SPIs but also encompass phage insertion sites and transposable elements. The islands were typically well conserved in several, but not all, isolates—a difference which may have implications in, e.g., host specificity.
Original languageEnglish
JournalMicrobial Ecology
Volume62
Issue number3
Pages (from-to)487-504
ISSN0095-3628
DOIs
Publication statusPublished - 2011

Fingerprint

Dive into the research topics of 'The Salmonella enterica Pan-genome'. Together they form a unique fingerprint.

Cite this