RNAseq alignment, quantification and transcript discovery with statistics (12/04/2021 - 15/04/2021)

The Toulouse Genotoul bioinformatics platform, in collaboration with the Genotoul Biostatistics platform, the Sigenae team and the MIAT unit, organize a 4 days long training course for bio-informaticians and biologists aiming at learning sequence analysis. It focuses on (protein coding) gene expression analysis using reads produced by ‘RNA-Seq’. This training session is designed to introduce sequences from ‘NGS’ (Next Generation Sequencing), particularly Illumina platforms (HiSeq). You will discover the standards file formats, learn about the usual biases of this type of data and run different kinds of analyses, such as spliced alignment on a reference genome, novel gene and transcript discovery, expression quantification of coding genes and transcripts. Finally you will be able to extract the differentially expressed genes.


calendar

 

This training focuses on practice. It consists of modules with a large variety of exercises described hereunder (PROVISIONAL SCHEDULE):

  • Introduction (Day 1): What will be my experimental plan? What is gene expression? What kind of technology can be used to monitor gene expression? What do the reads produced by NGS platforms (Illumina) using the RNA-Seq protocol look like? Which are the known biases of these sequences? Presentation of the dataset for the practical exercises
  • Sequence quality (Day 1).
  • Sequence cleaning (Day 1).
  • Splice aligning reads on a reference genome, Visualizing alignments and splice sites using IGV (Integrated Genome Viewer) (Day 1).
  • Raw count vs. abundance estimate (Day 2).
  • Discovering novel genes and transcripts Part 1 (Day 2).
  • Comparison of models, visualization and results of gene expression quantification and conclusions (Day 2).
  • Statistics: Exploratory analysis of count data (Day 3).
  • Statistics: Normalization and differential expression analysis (Day 4).

 

The session will take place in the room ‘salle de formation’ at the INRA center of Toulouse-Auzeville.


 

Prerequisites: ability to use a Linux and Cluster environment and basic knowledge in R.
You can check available R training session at Biostat platform and Unix and cluster session at bioinfo platform .

For self train we will try to list here available ressources :

  •  https://perso.math.univ-toulouse.fr/dejean/files/2020/12/intro_R.pdf

 

Bioinfo part : Material.

Biostat part : Material


 

Bookings: RNAseq alignment, quantification and transcript discovery with statistics
Ticket Type Price Spaces
Non-academic
for non-academic: 550€ + 20% taxes (TVA) per day
€2 200,00
Academic non INRAE
for academic but non-INRAE: 170 € + 20% taxes (TVA)
€680,00
INRAE
for INRA's staff: 150€ duty free per day;
€600,00