Skip to content

Exploring phase variation in bacterial immune systems through DNA inversion analysis. Investigating associations between invertons and restriction-modification systems in prokaryotes. Bioinformatics project for BI 2025.

Notifications You must be signed in to change notification settings

PolinaKalitina/Invertons

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Invertons as a source of programmed variability in the bacterial immunity systems

Repository structure

📂 Invertons/
├── 📂 code/
│   ├── 📂 notebooks/
│   │   ├── 🔬 Domain visualization.ipynb    # Protein domain visualization
│   │   └── 📊 Labjournal - general pipeline.ipynb  # Main analysis and graphs
│   │
│   └── 📂 environments/    # Environment files
│       ├── ⚙️ *.yml  # Conda environments for command-line tools
│       └── ⚙️ *.txt  # Pip-dependencies for Python and R scripts
│
├── 📂 data/                # All the data
│   ├── 📂 experimental data/
│   │   ├── 🧬 genes/       # .dna-files (gene sequences before and after inversion)
│   │   ├── 🧩 clusters.tsv          # Protein clustering results
│   │   ├── 🏷️ domains_genome_coords.tsv  # Domain coordinates
│   │   └── 🔀 invs_systems_intersected.tsv  # Invertons and associated genes
│   │
│   └── 📂 initial data/    # Inverton annotation
│
└── 📂 images/
    └── 🖼️ Domain structure and protein structure.pdf  # Finalized images

General information

Aim and objectives

Aim: explore the possibility of DNA inversions association with changes in bacterial immunity systems

Objectives:

  • For putative DNA inversions events, confirm the presence of inversions with currently existing approaches;
  • Explore the possibility of DNA inversion in currently unknown defense genes within defense islands;
  • Perform search of recombinases, associated with defense islands in bacteria genomes;
  • Identify and classify events of DNA inversions within defense-related genes;
  • Evaluate functional consequences of DNA inversion for each event.

Abstract

Bacterial defense systems, such as restriction-modification (RM) and CRISPR-Cas systems, employ diverse strategies to combat phages and mobile genetic elements (MGEs). Phase variation, a stochastic mechanism generating phenotypic heterogeneity, regulates some RM systems by altering DNA methylation specificity via S-subunit shuffling or target recognition domain (TRD) recombination. While invertible DNA sequences (invertons) are known to mediate phase variation in other contexts, their role in defense system regulation remains underexplored.

This study investigates the association between intragenic DNA inversions and bacterial immunity systems. Analyzing 1,601 inversion events across bacterial genomes, we identified only 21 cases overlapping RM system genes (Type I and II), with no evidence of widespread regulatory function. Most inversions (52.6%) led to gene inactivation via premature stop codons, while 31.6% truncated S-subunit TRDs, potentially altering target specificity. Notably, 15.8% of inversions had no discernible functional impact.

Our findings suggest that invertons occur stochastically in RM systems, primarily under neutral selection, rather than serving as a conserved immune regulatory mechanism. The absence of inversion-associated integrases and the lack of clustered domain architectures further support random integration. This work clarifies the limited role of DNA inversions in bacterial immunity and highlights the evolutionary constraints shaping defense system plasticity.

Results and conclusion

Results:

  • For putative DNA inversions events, the presence of 21 inversions within defense genes was confirmed with currently existing approaches;
  • Clustering of other invertable proteins within defense islands was uninformative;
  • Two integrases, associated with invertable defense genes in bacteria genomes were found;
  • Inversions occurred only within RM I and RM II systems involving S-subunit genes (66.7%), RM II Type G proteins (14.3%), restrictases (14.3%) and methylases (4.7%);
  • There types of inversion events of DNA inversions within defense-related genes were identified: loss of function for 28.6%, change in specificity for 42.9%, no apparent changes for 23.8%.

Conclusion: the vast majority of immune systems in the genomes of bacteria capable of DNA inversion are not subject to phase variations. DNA inversions associated with immune genes are more often within RM systems and lead to a change in specificity.

Data

Data used was composed of full genome sequences assembled from long reads and available though NCBI. Inverton annotation for the genomes was obtained from Chanin et. al, 2024 - table can be found in ./data/initial_data. Python script for data aquisition can be found in ./code/notebooks/Labjournal - general pipeline.ipynb in Data selection and extraction section.

System Requirements

Hardware

  • Memory: 48 GB RAM (recommended for large genome datasets)
  • Storage: 500 GB available space (external SSD/NVMe recommended)
  • Processor: Intel Core i7-1065G7 or equivalent (4+ cores)

Software

  • OS: Windows 10/11 with WSL2 (Ubuntu 22.04 LTS)
  • Core tools:
    • Conda (for environment management)
    • DefenseFinder 2.0.0 + PADLOC 2.0.0 (defense system annotation)
    • BEDTools 2.30.0 (genomic coordinate operations)
  • Visualization: R/gggenes + Python/Matplotlib

About

Exploring phase variation in bacterial immune systems through DNA inversion analysis. Investigating associations between invertons and restriction-modification systems in prokaryotes. Bioinformatics project for BI 2025.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published