Bioinformatics 03 2024

General description

The bioinformatics course consists of three blocks - the basics of analysis of DNA and protein sequences, principles of DNA sequencing and molecular phylogenetics. The general goal of the course is to introduce students and listeners to the basics of biological data analysis, primarily DNA and protein sequences, approaches to their alignment; DNA sequencing methods and their application for various research tasks; basics of visualization and manipulations with genome sequences using genome browsers; basics of population genetics and phylogenetic analysis. The course is designed for master's and postgraduate students with basic knowledge of genetics and molecular biology. The course aims at acquiring and deepening the existing practical skills of working with biological data and includes a series of seminars and practical classes with the execution of individual tasks using the methods and approaches discussed in the lectures.

All classes will be given in Ukrainian.

Lectures

Lecture 1. Bioinformatics as a text analysis.

Bohdan OSTASH

Definition of bioinformatics. How is it different from computational biology?
Information content of nucleotide and amino acid sequences. Shannon equation
Models that describe the frequency of occurrence of short substrings within a genetic sequence – Bernoulli and Markov chains
Likelihood and Bayesian approaches in bioinformatics – toy examples
Concept of the complexity of genetic sequences. Low complexity regions, their biological significance

Lecture 2. Pairwise sequence alignment.

Bohdan OSTASH

The concept of a pairwise alignment, basic terms (match, mismatch, gaps).
Homology of sequences.
Methods for evaluation of pairwise alignments: mechanistic and empirical approaches
Algorithms - dynamic programming (Smith-Waterman, Needleman-Wunsch) and heuristic (BLAST).
Karlin-Altschul statistics and the Expectation number (E).
Newer developments: Diamond

Lecture 3. Multiple sequence alignment (MSA).

Bohdan OSTASH

Definition
Progressive methods of multiple sequence alignment – CLUSTAL W2.
New tools of multiple sequence alignment (T-COFFEE, MUSCLE, MSAProbs).

Lecture 4. Models based on MSA

Bohdan OSTASH

Consensus sequence
Position-specific score matrices (PSSM). Weblogo
Concept of HMM and a toy example
TMHMM. GeneMark. Pfam. HHPred

Lecture 5. Protein and RNA structures – databases, inference

Bohdan OSTASH

Why biologists are obsessed with structures
PDB
AlphaFold
RNA databases

Lecture 6. The principles of DNA sequencing technologies

Yuriy REBETS

History of DNA sequencing development.
Sanger and Maxam–Gilbert sequencing.
Next-generation sequencing.
Third-generation sequencing.
Principles of genome assembly.
Problems of functional annotation of genomes.

Lecture 7. Everything in biology makes sense only in the light of evolution.

Oleksandr ZINENKO

Origin of genetic variation and basic concepts of evolutionary biology and molecular evolution.
Evolutionary forces, the fate of alleles in the population.
Neutral evolution, mutation/drift equilibrium.
Wondering in the space of genotypes.

Lecture 8. Methods of phylogenetic reconstruction.

Oleksandr ZINENKO

Phylogenetic trees: nomenclature, tree-thinking, editing and format conversion.
Three main approaches to the evolutionary history reconstruction. Distance-based methods, Likelihood methods and Bayesian inference phylogenetic reconstructions.
Substitution models.
Multilocus analysis of phylogenies, coalescent methods.

Lecture 9. Application of phylogenetics.

Oleksandr ZINENKO

Phylogeny as a backbone and null model of evolutionary biology.
Population history, demography and selection.
Method of phylogenetics contrasts.
Test for selection. Fst outliers.

Lecture 10. Phylogenomics.

Oleksandr ZINENKO

Challenges and limitations of whole genome-based phylogenetic reconstruction.
Genotyping by sequencing and artificial data downscaling. DdRAD sequencing.

Seminars

1. NCBI database.

Bohdan OSTASH

Working with NCBI databases – PubMed: doi, PMID, PMCID; GenBank – file structure; Genome – database structure and its use. Genome browser. Taxonomy. GEO.

2. How to use BLAST web service for pairwise alignment and interpret the results.

Bohdan OSTASH

3. Genome browsers.

Yuriy REBETS

4. A pipeline of phylogenetic analysis. Online pipelines, helping to analyze constantly updated datasets.

Oleksandr ZINENKO

5. Demographic reconstructions, selection tests and ddRAD data analyses.

Oleksandr ZINENKO

Timetable

Date	Time	Lecturer	Subject
13 March, Wed	18:00	Bohdan OSTASH	Lecture 1. Bioinformatics as a text analysis.
14 March, Thu	18:00	Bohdan OSTASH	Lecture 2. Pairwise sequence alignment.
16 March, Sat	10:00	Bohdan OSTASH	Seminar 1, group 1. NCBI database.
17 March, Sun	10:00	Bohdan OSTASH	Seminar 1, group 2. NCBI database.
20 March, Wed	18:00	Bohdan OSTASH	Lecture 3. Multiple sequence alignment (MSA).
21 March, Thu	18:00	Bohdan OSTASH	Lecture 4. Models based on MSA.
23 March, Sat	10:00	Bohdan OSTASH	Seminar 2, group 1. How to use BLAST web service for pairwise alignment and interpret the results.
24 March, Sun	10:00	Bohdan OSTASH	Seminar 2, group 2. How to use BLAST web service for pairwise alignment and interpret the results.
27 March, Wed	18:00	Bohdan OSTASH	Lecture 5. Protein and RNA structures – databases, inference.
28 March, Thu	18:00	Yuriy REBETS	Lecture 6. The principles of DNA sequencing technologies.
30 March, Sat	10:00	Yuriy REBETS	Seminar 3, group 1. Genome browsers.
31 March, Sun	10:00	Yuriy REBETS	Seminar 3, group 2. Genome browsers.
3 April, Wed	18:00	Oleksandr ZINENKO	Lecture 7. Everything in biology makes sense only in the light of evolution.
4 April, Thu	18:00	Oleksandr ZINENKO	Lecture 8. Methods of phylogenetic reconstruction.
6 April, Sat	10:00	Oleksandr ZINENKO	Seminar 4, group 1. A pipeline of phylogenetic analysis. Online pipelines, helping to analyze constantly updated datasets.
7 April, Sun	10:00	Oleksandr ZINENKO	Seminar 4, group 2. A pipeline of phylogenetic analysis. Online pipelines, helping to analyze constantly updated datasets.
10 April, Wed	18:00	Oleksandr ZINENKO	Lecture 9. Application of phylogenetics.
11 April, Thu	18:00	Oleksandr ZINENKO	Lecture 10. Phylogenomics.
13 April, Sat	10:00	Oleksandr ZINENKO	Seminar 5, group 1. Demographic reconstructions, selection tests and ddRAD data analyses.
14 April, Sun	10:00	Oleksandr ZINENKO	Seminar 5, group 2. Demographic reconstructions, selection tests and ddRAD data analyses.
17 April, Wed	18:00	Yuriy REBETS	Test

Lecturers

Bioinformatics 03 2024

Bohdan Ostash

Oleksandr Zinenko

Yuriy Rebets