Automated workflow composition in mass spectrometry-based proteomics

Magnus Palmblad, Anna Lena Lamprecht, Jon Ison*, Veit Schwämmle

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

137 Downloads (Pure)

Abstract

MOTIVATION: Numerous software utilities operating on mass spectrometry (MS) data are described in the literature and provide specific operations as building blocks for the assembly of on-purpose workflows. Working out which tools and combinations are applicable or optimal in practice is often hard. Thus researchers face difficulties in selecting practical and effective data analysis pipelines for a specific experimental design.

RESULTS: We provide a toolkit to support researchers in identifying, comparing and benchmarking multiple workflows from individual bioinformatics tools. Automated workflow composition is enabled by the tools' semantic annotation in terms of the EDAM ontology. To demonstrate the practical use of our framework, we created and evaluated a number of logically and semantically equivalent workflows for four use cases representing frequent tasks in MS-based proteomics. Indeed we found that the results computed by the workflows could vary considerably, emphasizing the benefits of a framework that facilitates their systematic exploration.

AVAILABILITY AND IMPLEMENTATION: The project files and workflows are available from https://github.com/bio-tools/biotoolsCompose/tree/master/Automatic-Workflow-Composition.

SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Original languageEnglish
JournalBioinformatics
Volume35
Issue number4
Pages (from-to)656-664
Number of pages9
ISSN1367-4803
DOIs
Publication statusPublished - 15 Feb 2019

Cite this