Spectrum of Protein Location in Proteomes Captures Evolutionary Relationship Between Species

Valérie Marot-Lassauzaie*, Tatyana Goldberg, Jose Juan Almagro Armenteros, Henrik Nielsen, Burkhard Rost

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

66 Downloads (Orbit)

Abstract

The native subcellular location (also referred to as localization or cellular compartment) of a protein is the one in which it acts most frequently; it is one aspect of protein function. Do ten eukaryotic model organisms differ in their location spectrum, i.e., the fraction of its proteome in each of seven major cellular compartments? As experimental annotations of locations remain biased and incomplete, we need prediction methods to answer this question. After systematic bias corrections, the complete but faulty prediction methods appeared to be more appropriate to compare location spectra between species than the incomplete more accurate experimental data. This work compared the location spectra for ten eukaryotes: Homo sapiens (human), Gorilla gorilla (gorilla), Pan troglodytes (chimpanzee), Mus musculus (mouse), Rattus norvegicus (rat), Drosophila melanogaster (fruit/vinegar fly), Anopheles gambiae (African malaria mosquito), Caenorhabitis elegans (nematode), Saccharomyces cerevisiae (baker’s yeast), and Schizosaccharomyces pombe (fission yeast). The two largest classes were predicted to be the nucleus and the cytoplasm together accounting for 47–62% of all proteins, while 7–21% of the proteins were predicted in the plasma membrane and 4–15% to be secreted. Overall, the predicted location spectra were largely similar. However, in detail, the differences sufficed to plot trees (UPGMA) and 2D (PCA) maps relating the ten organisms using a simple Euclidean distance in seven states (location classes). The relations based on the simple predicted location spectra captured aspects of cross-species comparisons usually revealed only by much more detailed evolutionary comparisons. Most interestingly, known phylogenetic relations were reproduced better by paralog-only than by ortholog-only trees.

Original languageEnglish
JournalJournal of Molecular Evolution
Volume89
Issue number8
Pages (from-to)544-553
ISSN0022-2844
DOIs
Publication statusPublished - 2021

Bibliographical note

Funding Information:
Thanks to Tim Karl (TUM) for invaluable help with hardware and software; to Inga Weise (TUM) for administrative support; to Michael Bernhofer & Jonas Reeb (both TUM) for help with the error correction; to Maria Schelling and Michael Bernhofer for advice and help with the validation; to Michael Heinzinger for help with the first revision. We are grateful to the anonymous reviewers and editors for their important help. Thanks to Ioannis Xenarios (Swiss-Prot, SIB, Geneva), Mathias Uhlen (HPA Uppsala Univ), and their crews for maintaining excellent databases and to all experimentalists who enabled this analysis by making their data publicly available.

Publisher Copyright:
© 2021, The Author(s).

Keywords

  • Genome sequence analysis
  • Prediction of cellular compartment
  • Protein location
  • Species comparisons

Fingerprint

Dive into the research topics of 'Spectrum of Protein Location in Proteomes Captures Evolutionary Relationship Between Species'. Together they form a unique fingerprint.

Cite this