Abstract
Sequencing costs are falling, but the cost of data analysis remains high, often because unforeseen problems arise, such as insufficient depth of sequencing or batch effects. Experimenting with data analysis methods during the planning phase of an experiment can reveal unanticipated problems and build valuable bioinformatics expertise in the organism or process being studied. This protocol describes using R Markdown and RStudio, user-friendly tools for statistical analysis and reproducible research in bioinformatics, to analyze and document the analysis of an example RNA-Seq data set from tomato pollen undergoing chronic heat stress. Also, we show how to use Integrated Genome Browser to visualize read coverage graphs for differentially expressed genes. Applying the protocol described here and using the provided data sets represent a useful first step toward building RNA-Seq data analysis expertise in a research group.
Original language | English |
---|---|
Title of host publication | Plant Functional Genomics |
Subtitle of host publication | Methods and Protocols: Second Edition |
Publisher | Springer New York |
Pages | 481-501 |
Number of pages | 21 |
ISBN (Electronic) | 9781493924448 |
ISBN (Print) | 9781493924431 |
DOIs | |
State | Published - 11 Mar 2015 |
Bibliographical note
Publisher Copyright:© Springer Science+Business Media New York 2015. All rights are reserved.
Funding
Funders | Funder number |
---|---|
National Science Foundation | 0955910 |
National Institute of General Medical Sciences | R01GM103463 |
National Center for Research Resources | R01RR032048 |
Keywords
- Differential gene expression
- Integrated genome browser
- Pollen
- R
- RNA-Seq
- Tomato
- Visualization
- edgeR