RNA-sequencing: data analysis with R

General description

This comprehensive, hands-on course provides training on the bioinformatics of gene expression analysis, focusing on bulk RNA-sequencing data using R, R Studio, and various R packages. Additionally, the course introduces foundational concepts of single-cell RNA-seq analysis. All instructors are practicing bioinformaticians and lecturers at training courses of NGO Genomics UA. Each session includes an introductory lecture followed by a practical tutorial. During tutorials, students will engage live in data analysis under the guidance of an instructor and participate in interactive assessments.

The course begins with an R boot camp to introduce newcomers to the R programming language and R Studio, while also serving as a refresher for those already familiar with R. Next, the basics of expression analysis, advanced R techniques, differential expression analysis using DESeq2, data visualization, functional analysis, and single-cell expression analysis are discussed. A final assessment in the form of multiple-choice questions will conclude the course, requiring a minimum pass rate of 70% on each test for successful completion. Besides, the course features a team project where groups of 2-3 students analyze expression data from a publicly available dataset, starting from a count matrix. Two final sessions focus on student presentations and feedback from the instructors. These mini-projects are not graded but successful teams will receive additional certificates from NGO Genomics UA, signed by the instructors.

All instruction will be delivered in Ukrainian, but as the bioinformatics packages are documented in English and the field uses established English terminology, the necessary background will be provided in each session.

Lectures
1. Introduction to R. Part 1
Valeriia VASYLIEVA

Navigating R studio, introducing R script structure. The notion of data classes & types. Basic code writing, including syntax, data slicing, substitution, and transformation. Application of loops, branches (if-else), and creation of functions.

2. Introduction to R. Part 2
Valeriia VASYLIEVA

Loading and exporting files. Illustrating the essential steps in exploring the data. A brief comparison of basic R and Tidyverse. Data visualization and interpretation of the graphs.

3. Basics of RNA-seq analysis. Part 1
Serhiy NAUMENKO

Introduction to gene expression analysis and RNA-seq data. Experiment planning and design. Raw data. Metadata. Types and methods of normalization. P-values and multiple testing corrections. Negative binomial distribution and expression data.

4. Basics of RNA-seq analysis. Part 2
Serhiy NAUMENKO

Introduction of the main dataset used in the course. Walk through the main stages and scripts of the expression analysis using the dataset

5. Introduction to R. Part 3
Valeriia VASYLIEVA

Normality test, data transformation, test for homoscedasticity. Understanding covariance and correlation. hypothesis testing (t-test, ANOVA)

6. . Differential expression. Part 1
Oleksandr PETRENKO

Gene pre-processing and annotation, exploratory analysis including dimensionality reduction (PCA, t-SNE), normalization effect on dimensionality reduction

7. Differential expression. Part 2
Oleksandr PETRENKO

DESeq2 test selection, definition of differentially expressed genes thresholds, pairwise testing, multiple group comparison, results interpretation, differential expression tools beyond DESeq2

8. Data visualization. Part 1
Oleksandr SHYNKARENKO

Introduction to data visualization. ggplot2. Basic plots: scatterplot, density plot, histogram. Defining mean, median, and mode. Selection and application of parametric and non-parametric tests. Box and violin plots. Principal component analysis and t-SNE.

9. Data visualization. Part 2
Oleksandr SHYNKARENKO

Venn diagrams and upSet plots. Volcano plots. Heatmaps. Interactive plots. Rmarkdown best practices. Embedding tables and files. Bulk RNA-Seq report structure.

10. Functional analysis. Part 1
Oleksandr PETRENKO, Maryna KORSHEVNIUK

Functional approaches: over-representation, gene set enrichment, pathway analysis. Working with GO, Reactome, and KEGG databases

11. Functional analysis. Part 2
Oleksandr PETRENKO

Basics of graph theory and network analysis. Genes co-expression, transcriptional regulation, identification of hub genes, and interpretation

12. Introduction to single-cell RNA sequencing. Part 1
Maryna KORSHEVNIUK

Introduction to Single Cell RNA-seq analysis using Seurat and PBMC dataset: normalization, clustering, differential expression between clusters and between conditions

13. Introduction to single-cell RNA sequencing. Part 2
Maryna KORSHEVNIUK, Serhiy NAUMENKO

Single Cell RNA-seq analysis: visualizations. Methods of deconvolution of expression signatures

14-15. Session 14-15. Project presentations and feedback sessions

Presentation of students’ projects and instructors' feedback

Level
Bachelor and master students
Duration
August-September
Language
Ukrainian
Certificate
1 credit ECTS
Timetable

Date

Topic

Instructor

2024-08-01 Thu

Introduction to R. Part 1

Valeriia VASYLIEVA

2024-08-05 Mon

Introduction to R. Part 2

Valeriia VASYLIEVA

2024-08-09  Fri

Basics of RNA-seq analysis. Part 1

Serhiy NAUMENKO

2024-08-12 Mon

Basics of RNA-seq analysis. Part 2

Serhiy NAUMENKO

2024-08-16 Fri 

Introduction to R. Part 3

Valeriia VASYLIEVA

2024-08-19 Mon

Differential expression. Part 1

Oleksandr PETRENKO

2024-08-23 Fri 

Differential expression. Part 2

Oleksandr PETRENKO

2024-08-26 Mon

Data visualization. Part 1

Oleksandr SHYNKARENKO

2024-08-30 Fri 

Data visualization. Part 2

Oleksandr SHYNKARENKO

2024-09-02 Mon

Functional analysis. Part 1

Oleksandr PETRENKO, Maryna KORSHEVNIUK

2024-09-06 Fri 

Functional analysis. Part 2

Oleksandr PETRENKO

2024-09-09 Mon

Introduction to single-cell RNA sequencing. Part 1

Maryna KORSHEVNIUK

2024-09-13 Fri 

Introduction to single-cell RNA sequencing. Part 2

Maryna KORSHEVNIUK, Serhiy NAUMENKO

2024-09-16 Mon  

Project presentations and feedback sessions

 

2024-09-23 Mon Project presentations and feedback sessions  
Lecturers

MSc, PhD candidate in personalized medicine and single cell multiomics, Groningen University Medical Center, the Netherlands

A scientist-physician and research associate at the Medical University of Vienna

Bioinformatician, co-founder and former CTO of HTuO Biosciences Inc. (Vancouver, Canada)

PhD bioinformatician, a specialist in genomic bioinformatics in Newborn Screening Ontario, Canada

Master's student at the University of Sherbrooke, Canada