Course objective: Introducing statistical methods for high-throughput gene expression data analysis focusing on the search for genes whose expression is related to variations in experimental covariates.

2021

Organization: The course is divided into four sessions. Each session is made of a lecture and practical classes. In those in-class practical classes, students are given the opportunity to tackle problem-solving exercises, with or without the use of R.

The following documents are provided for each session:

A video lecture: R tutorial whose slides, data and R script are provided;
A PDF and/or Rmd file containing a self-directed learning exercise.

To start with Rmarkdown: brief introduction.

Session 1

Objective: Introducing general principles of gene expression data normalization. Normalization is a preliminary step in genomic data analysis consisting in identifying and removing variations only due to technological biases.

Video lecture (divided in five short clips: 1, 2, 3, 4, 5) with slides, data and R script;
In-class activities: exercise.

Session 2

Objective: Being able to use the R package limma to import gene expression data into a R session and control the quality of the gene expression measurements.

Video lecture and R script;
In-class activities: exercise (PDF and Rmd).

Session 3

Objective: Controlling false positives when selecting genes whose mean expression is significantly related to experimental covariates.

Video lecture (divided in five short clips: 1, 2, 3, 4, 5) with slides, data(expression data, external data, experimental covariates) and R script (R, Rmd, PDF);
In-class activities: exercise.

Session 4

Objective: Using clustering procedures to give more insight to the list of selected genes. This short session is dedicated to clustering methods in order to extract groups of microarrays whose expression profiles are similar and groups of co-expressed genes.

Video lecture with data (expression data for selected genes, factor-adjusted expression data for selected genes) and R script.

First steps in genomic data analysis

2022

2021

Session 1

Session 2

Session 3

Session 4