The simple fool's guide to population genomics via RNA-Seq: An introduction to high-throughput sequencing data analysis

Publication: Research - peer-reviewJournal article – Annual report year: 2012

  • Author: De Wit, P., United States

    Department of Biology Stanford University Hopkins Marine Station , United States

  • Author: Pespeni, M.H., United States

    Department of Biology Indiana University, United States

  • Author: Ladner, J.T., United States

    Department of Biology Stanford University Hopkins Marine Station , United States

  • Author: Barshis, D.J., United States

    Department of Biology Stanford University Hopkins Marine Station , United States

  • Author: Seneca, F., United States

    Department of Biology Stanford University Hopkins Marine Station , United States

  • Author: Jaris, H., United States

    Department of Biology Stanford University Hopkins Marine Station , United States

  • Author: Therkildsen, Nina Overgaard

    Section for Population Ecology and Genetics, National Institute of Aquatic Resources, Technical University of Denmark, Sektion for Populationsgenetik Vejlsøvej 39, 8600, Silkeborg, Denmark

  • Author: Morikawa, M., United States

    Duke University, United States

  • Author: Palumbi, S.R., United States

    Department of Biology Stanford University Hopkins Marine Station , United States

View graph of relations

High-throughput sequencing technologies are currently revolutionizing the field of biology and medicine, yet bioinformatic challenges in analysing very large data sets have slowed the adoption of these technologies by the community of population biologists. We introduce the 'Simple Fool's Guide to Population Genomics via RNA-seq' (SFG), a document intended to serve as an easy-to-follow protocol, walking a user through one example of high-throughput sequencing data analysis of nonmodel organisms. It is by no means an exhaustive protocol, but rather serves as an introduction to the bioinformatic methods used in population genomics, enabling a user to gain familiarity with basic analysis steps. The SFG consists of two parts. This document summarizes the steps needed and lays out the basic themes for each and a simple approach to follow. The second document is the full SFG, publicly available at http://sfg.stanford.edu, that includes detailed protocols for data processing and analysis, along with a repository of custom-made scripts and sample files. Steps included in the SFG range from tissue collection to de novo assembly, blast annotation, alignment, gene expression, functional enrichment, SNP detection, principal components and F ST outlier analyses. Although the technical aspects of population genomics are changing very quickly, our hope is that this document will help population biologists with little to no background in high-throughput sequencing and bioinformatics to more quickly adopt these new techniques
Original languageEnglish
JournalMolecular Ecology Resources
Publication date2012
Volume12
Pages1058-1067
ISSN1755-098X
DOIs
StatePublished
CitationsWeb of Science® Times Cited: 17
Download as:
Download as PDF
Select render style:
APAAuthorCBEHarvardMLAStandardVancouverShortLong
PDF
Download as HTML
Select render style:
APAAuthorCBEHarvardMLAStandardVancouverShortLong
HTML
Download as Word
Select render style:
APAAuthorCBEHarvardMLAStandardVancouverShortLong
Word

ID: 10851087