High-throughput ’omics studies generate ever larger datasets and, as a consequence, complex data interpretation challenges. This course focusses on statistical concepts involved in preprocessing, quantification and differential analysis of high-throughput ’omics data. The core focus will be on shotgun proteomics and (bulk and single-cell) RNA-sequencing. Experimental design is essential to allow for correct interpretation in all ’omics studies, and we will cover how to design a statistically efficient experiment, as well as discuss the impact experimental design has on how we model ’omics data, introducing concepts such as blocking. The course will rely exclusively on free and user-friendly open-source tools in R/Bioconductor. We hope that this will provide a solid basis for beginners, but will also bring new perspectives to those already familiar with standard data analysis workflows for proteomics and next-generation sequencing applications.
This course is oriented towards biologists and bioinformaticians with a particular interest in differential analysis for quantitative ’omics data.
The prerequisites for the Statistical Genomics Analysis course are the successful completion of a basic course of statistics that covers topics on data exploration and descriptive statistics, statistical modeling, and inference: linear models, confidence intervals, t-tests, F-tests, anova, chi-squared test. The basis concepts may be revisited in the online course at https://statomics.github.io/PSLS/ (English) and in https://statomics.github.io/sbc/ (Dutch).
In addition, knowledge of programming in R
is preferred.
A primer to R
and Data visualization in R
can
be found at:
R
Basics: https://dodona.ugent.be/nl/courses/335/R
Data Exploration: https://dodona.ugent.be/nl/courses/345/Participants are required to bring their own laptop with R version 4.2 or greater.
We also recommend to also install the latest version of RStudio.
Installation script: to install all required packages, please copy and paste this line of code in your R console.
source("https://raw.githubusercontent.com/statOmics/SGA/master/install.R")
1.1. Position of the course: HTML
1.2. Recap Linear Models
Tutorial: Design
Solution: Heart Example
Solution: Heart Example Homework