BioGen

BioGen is a toolkit for genetic data processing. It allows you to evaluate, validate, and prepare your nucleotide sequences for downstream analysis, including FASTQ filtering by GC%, length, and quality; FASTA reformatting; BLAST best-hit extraction; and retrieval of flanking genes from GenBank files.

Authors:

Andrey Nekrasov
Gregor Mendel
James D. Watson
Francis H.C. Crick

BioGen documentation is available at https://site.lol/this/link/does/not/exists

Installation

You don't need that...

Usage and examples

main.py

run_dna_rna_tools - Performs validation and various operations on a nucleotide sequence (DNA or RNA).

run_dna_rna_tools('TTUU', 'is_nucleic_acid') # False !!
run_dna_rna_tools('ATG', 'transcribe') # 'AUG'
run_dna_rna_tools('ATG', 'reverse') # 'GTA'
run_dna_rna_tools('AtG', 'complement') # 'TaC'
run_dna_rna_tools('ATg', 'reverse_complement') # 'cAT'
run_dna_rna_tools('ATG', 'aT', 'reverse') # ['GTA', 'Ta']

filter_fastq - Filter FASTQ reads by GC%, length, and mean Phred quality; write passing reads to output.

filter_fastq(seqs = EXAMPLE_FASTQ, gc_bounds = (20, 80), length_bounds = (10, 30), quality_threshold = 10)

bio_files_processor.py

convert_multiline_fasta_to_oneline – Converts multi-line FASTA sequences into a single-line format and saves the result.

convert_multiline_fasta_to_oneline(input_fasta: "input.fasta", output_fasta: "output.fasta")

parse_blast_output - Parses a BLAST output file and keeps only the best match for each query sequence.

parse_blast_output(input_file: "input.txt", output_file: "output.txt")

select_genes_from_gbk_to_fasta - Extracts the nearest flanking genes of a target gene from a GenBank file and saves them in FASTA format.

select_genes_from_gbk_to_fasta(input_gbk: "input.gbk", genes: ["gene1", "gene2"], n_before: 20, n_after: 25, output_fasta: "output.fasta")

License

The MIT License (Made In Tears)

Disclaimer: The author takes no responsibility for any data loss, mental breakdowns, or existential crises caused by using this toolkit. You chose this path.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
modules		modules
.gitignore		.gitignore
README.md		README.md
bio_files_processor.py		bio_files_processor.py
biogen_logo.png		biogen_logo.png
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BioGen

Installation

Usage and examples

main.py

bio_files_processor.py

License

Disclaimer: The author takes no responsibility for any data loss, mental breakdowns, or existential crises caused by using this toolkit. You chose this path.

About

Uh oh!

Releases

Packages

Languages

CaptainLabMan/biogen_tools

Folders and files

Latest commit

History

Repository files navigation

BioGen

Installation

Usage and examples

main.py

bio_files_processor.py

License

Disclaimer: The author takes no responsibility for any data loss, mental breakdowns, or existential crises caused by using this toolkit. You chose this path.

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages