The simple fool's guide to population genomics via RNA-Seq: An introduction to high-throughput sequencing data analysis

P. De Wit, M.H. Pespeni, J.T. Ladner, D.J. Barshis, F. Seneca, H. Jaris, Nina Overgaard Therkildsen, M. Morikawa, S.R. Palumbi

Research output: Contribution to journalJournal articleResearchpeer-review

Abstract

High-throughput sequencing technologies are currently revolutionizing the field of biology and medicine, yet bioinformatic challenges in analysing very large data sets have slowed the adoption of these technologies by the community of population biologists. We introduce the 'Simple Fool's Guide to Population Genomics via RNA-seq' (SFG), a document intended to serve as an easy-to-follow protocol, walking a user through one example of high-throughput sequencing data analysis of nonmodel organisms. It is by no means an exhaustive protocol, but rather serves as an introduction to the bioinformatic methods used in population genomics, enabling a user to gain familiarity with basic analysis steps. The SFG consists of two parts. This document summarizes the steps needed and lays out the basic themes for each and a simple approach to follow. The second document is the full SFG, publicly available at http://sfg.stanford.edu, that includes detailed protocols for data processing and analysis, along with a repository of custom-made scripts and sample files. Steps included in the SFG range from tissue collection to de novo assembly, blast annotation, alignment, gene expression, functional enrichment, SNP detection, principal components and F ST outlier analyses. Although the technical aspects of population genomics are changing very quickly, our hope is that this document will help population biologists with little to no background in high-throughput sequencing and bioinformatics to more quickly adopt these new techniques
Original languageEnglish
JournalMolecular Ecology Resources
Volume12
Pages (from-to)1058-1067
ISSN1755-098X
DOIs
Publication statusPublished - 2012

Fingerprint

Dive into the research topics of 'The simple fool's guide to population genomics via RNA-Seq: An introduction to high-throughput sequencing data analysis'. Together they form a unique fingerprint.

Cite this