This repository contains homeworks completed as part of the Python course during the annual retraining program at the Bioinformatics Institute (2023/2024). The required third-party dependencies are listed in the requirements.txt.
Topics covered: basic and advanced Python syntax, functions, working with files, OOP, API and web page parsing, code testing.
Below is a brief description of the module:
Biopython-based FASTQ records filter working according to the specified criteria.
OOP-based class that allows to perform simple operations with biological sequences (DNA, RNA, amino acid sequences).
Decorator for logging function execution and sending logs to Telegram bot. It is written without using libraries that automate the creation of a Telegram bot.
Python API for http://hollywood.mit.edu/GENSCAN.html. Represents the output of the GENSCAN prediction.
Function for conversion of multi-line FASTA sequences to single-line sequences.
Function that selects neighbor genes for the gene of interest from the GBK file and writes their protein sequences into FASTA format.
Context manager for reading FASTA files.
Class for custom implementation of a Random Forest classifier. It uses thread parallelization to get results faster.
A set of tests using the pytest library that checks the correct execution of some module functions.
Contains selected examples of how this module works.