Python for Biologists

Details of the curriculum

The course comprises lectures and hands-on sessions devoted to programming in Python, specifically designed for biologists who are new to programming. It focuses on Python to address applied problems in statistics and bioinformatics. We start with fundamental programming concepts, scripting, and file manipulation before moving on to the use of specialized packages for handling biological data.

All classes will be given in Ukrainian.

Lectures
Lecture 1. Basics of computer architecture: memory, processors, GPU.
Taras VASYLYSHYN

Programming languages and their applications. Advantages and disadvantages of Python. Setting up your first project. Fundamentals of writing programs and code formatting in Python.

Lecture 2. Data types in Python.
Taras VASYLYSHYN

Numeric Types and Boolean: Understanding integers, floats and the usage of Boolean values. Collections: Exploring sequences like Strings, Lists, Tuples, Dictionaries and Sets. Testing code.

Lecture 3. Functions.
Taras VASYLYSHYN

Defining and invoking functions, parameters, and return values. Control Structures. Loops: Using “for” and “while” loops for repeated execution. Logical Expressions: Implementing “if”, “else”, and “elif” statements for decision making.

Lecture 4. Basic string operations.
Taras VASYLYSHYN

String formatting. Finding substrings. Comparing strings and checking string properties. Regular expressions.

Lecture 5. Working with different types of text files in Python.
Taras VASYLYSHYN

Reading and Writing Text Files. Handling CSV Files. File Manipulation.

Lecture 6. Statistics in Python.
Viktor HUSAK

Descriptive statistics of data series. Testing sample for normal distribution.

Lecture 7. Analysis of variance (ANOVA). Correlation and regression analysis.
Viktor HUSAK
Lecture 8. Basic data visualization.
Viktor HUSAK

(Seaborn and Matplotlib). Creating plots for biological data: scatter plots, bar plots, box plots, heatmaps, volcano plots. Adding error bars and statistical annotations. Plot customization with proper labels and legends. Export of high-resolution images.

Lecture 9. Biopython application to genomic data.
Sviatoslav KHARUK

FASTA file reading/writing. Sequence object manipulation, DNA/RNA transcription and translation. Sequence statistics calculation.

Lecture 10. Pandas.
Sviatoslav KHARUK

DataFrame operations for biological data - reading CSV/TSV files, filtering, sorting, merging datasets. Handling of missing values. Basic data cleaning. Expression data normalization.

Seminars
Seminar 1. Introduction to Python. Code formatting in Python. Input and output.
Taras VASYLYSHYN
Seminar 2. Numerical data types. Precision. Arithmetic operations. Mathematical functions.
Taras VASYLYSHYN
Seminar 3. Testing code.
Taras VASYLYSHYN
Seminar 4. Boolean data type. Logical operations.
Taras VASYLYSHYN
Seminar 5. Loops and branching.
Taras VASYLYSHYN
Seminar 6. Lists and Tuples.
Taras VASYLYSHYN
Seminar 7. Dictionaries and Sets.
Taras VASYLYSHYN
Seminar 8. String operations.
Taras VASYLYSHYN
Seminar 9. Regular expressions.
Taras VASYLYSHYN
Seminar 10. Working with text files.
Taras VASYLYSHYN
Seminar 11. Biological data visualization: Seaborn and Matplotlib.
Sviatoslav KHARUK
Seminar 12. Work with big datasets.
Sviatoslav KHARUK
Seminar 13. Applied project: software for statistical analysis (I) – interface.
Viktor HUSAK
Seminar 14. Applied project: software for statistical analysis (II) – implementation of descriptive statistics.
Viktor HUSAK
Seminar 15. Applied project: software for statistical analysis (III) – implementation of outliers.
Viktor HUSAK
Seminar 16. Applied project: software for statistical analysis (IV) – implementation of distribution normality check.
Viktor HUSAK
Seminar 17. Applied project: software for statistical analysis (V) – comparing experimental results: Student's t-Test.
Viktor HUSAK
Seminar 18. Applied project: software for statistical analysis (VI) – comparing experimental results: One-way ANOVA (ANalysis Of VAriance) with post-hoc Tukey HSD (Honestly Significant Difference). Test Calculator for comparing multiple treatments.
Viktor HUSAK
Seminar 19. Applied project: software for statistical analysis (VII) – One-way ANOVA with post-hoc Dunn Test Calculator for comparing multiple treatments.
Viktor HUSAK
Seminar 20. Applied project: software for statistical analysis (VIII) – implementation of correlation and regression.
Viktor HUSAK
Level
Bachelor and master students
Lectures
10
Practical classes
20
Duration
2 Months
Language
Ukrainian
Certificate
2 credits ECTS
Lecturers

Leading specialist at the Department of Biochemistry and Biotechnology at Vasyl Stefanyk Precarpathian National University, Ivano-Frankivsk.

Doctor of Science in Physics and Mathematics, Professor of the Department of Mathematical and Functional Analysis of PNU where he teaches several courses including "Statistics and Python"

Associate Professor of the Department of Biochemistry and Biotechnology at the Vasyl Stefanyk Precarpathian National University.