ACTIONet Python

Python bindings for ACTIONet (Action-based Cell-Type Identification and Organism Niche Extraction Tool), a single-cell multi-resolution data analysis toolkit.

This package wraps the C++ backend libactionet without modifications, providing a Python interface with AnnData as the core data container, designed to integrate seamlessly with the scanpy ecosystem.

Features

Full C++ backend: Leverages the high-performance libactionet C++ library
AnnData integration: Native support for AnnData objects used throughout the Python single-cell ecosystem
Scanpy compatibility: Works alongside standard scanpy workflows
Multi-resolution analysis: ACTION decomposition for multi-scale archetype discovery
Network-based analysis: Build and analyze cell-cell interaction networks
Cross-platform: Supports macOS (Intel & Apple Silicon) and Linux (manylinux2014)

Installation

Prerequisites

macOS:

# Install Xcode Command Line Tools
xcode-select --install

# Install dependencies via Homebrew
brew install cmake openblas lapack

Linux (Debian/Ubuntu):

sudo apt-get update
sudo apt-get install build-essential cmake libopenblas-dev liblapack-dev

Conda (rootless / HPC):

conda install -c conda-forge cmake openblas lapack

Install from source

# Clone repository
git clone https://github.com/KellisLab/actionet-python.git
cd actionet-python

# Initialize libactionet submodule
git submodule update --init --recursive

# Install in development mode
pip install -e .

# Or build and install
pip install .

Building with Intel MKL (Recommended for Best Performance)

Intel MKL provides highly optimized BLAS/LAPACK implementations and can significantly improve performance, especially for large matrix operations. ACTIONet will automatically detect and use MKL if available.

Option 1: Using Conda (Easiest)

# Create a new environment with MKL
conda create -n actionet-mkl python=3.12
conda activate actionet-mkl

# Install Intel MKL and build dependencies
conda install -c conda-forge cmake compilers numpy scipy mkl mkl-include

# Clone and build ACTIONet
git clone https://github.com/KellisLab/actionet-python.git
cd actionet-python
git submodule update --init --recursive

# Build with MKL (automatically detected)
pip install -e .

Option 2: Using Intel oneAPI (Most Optimized)

For maximum performance, use Intel's oneAPI toolkit with the Intel C++ compiler:

# Download and install Intel oneAPI Base Toolkit
# https://www.intel.com/content/www/us/en/developer/tools/oneapi/base-toolkit.html

# Source the Intel environment
source /opt/intel/oneapi/setvars.sh

# Set environment variables for Intel MKL and compiler
export CC=icx
export CXX=icpx
export MKLROOT=/opt/intel/oneapi/mkl/latest

# Clone and build
git clone https://github.com/KellisLab/actionet-python.git
cd actionet-python
git submodule update --init --recursive

# Build with Intel compiler and MKL
pip install -e .

Option 3: System MKL (Linux)

# Install Intel MKL from package manager (Ubuntu/Debian)
wget -O- https://apt.repos.intel.com/intel-gpg-keys/GPG-PUB-KEY-INTEL-SW-PRODUCTS.PUB | gpg --dearmor | sudo tee /usr/share/keyrings/oneapi-archive-keyring.gpg > /dev/null
echo "deb [signed-by=/usr/share/keyrings/oneapi-archive-keyring.gpg] https://apt.repos.intel.com/oneapi all main" | sudo tee /etc/apt/sources.list.d/oneAPI.list
sudo apt update
sudo apt install intel-oneapi-mkl intel-oneapi-mkl-devel

# Source MKL environment
source /opt/intel/oneapi/mkl/latest/env/vars.sh

# Build ACTIONet
cd actionet-python
pip install -e .

Verifying MKL Usage

After installation, verify that MKL is being used:

import numpy as np
print(np.__config__.show())  # Should show MKL in BLAS/LAPACK info

Performance Tuning with MKL

For optimal performance, set these environment variables:

# Use all available cores
export MKL_NUM_THREADS=$(nproc)

# For GNU OpenMP runtime (conda default)
export MKL_THREADING_LAYER=GNU
export OMP_NUM_THREADS=$(nproc)

# For Intel OpenMP runtime (Intel compiler)
export MKL_THREADING_LAYER=INTEL
export OMP_NUM_THREADS=$(nproc)

# Disable MKL's internal threading (if using external parallelization)
export MKL_NUM_THREADS=1

Expected Performance Improvements

With Intel MKL, you should see:

2-4x faster matrix operations (SVD, matrix multiplication)
Especially beneficial for large datasets (>100K cells)
Better multi-threading efficiency
Lower memory usage for some operations

Troubleshooting MKL Builds

MKL not detected:

Ensure MKLROOT environment variable is set
Verify mkl-include is installed (conda) or headers are in /opt/intel/oneapi/mkl/latest/include

Mixed OpenMP runtime warnings:

Set MKL_THREADING_LAYER=GNU for conda environments
Set MKL_THREADING_LAYER=INTEL for Intel oneAPI builds
Or disable OpenMP: pip install . -C cmake.define.LIBACTIONET_OPENMP_RUNTIME=OFF

Link errors with Intel compiler:

Ensure setvars.sh is sourced before building
Try: export LDFLAGS="-L${MKLROOT}/lib/intel64"

Quick Start

import scanpy as sc
import actionet as an

# Load data
adata = sc.read_h5ad("your_data.h5ad")

# Preprocess
sc.pp.normalize_total(adata, target_sum=1e4)
sc.pp.log1p(adata)

# ACTIONet pipeline
an.reduce_kernel(adata, n_components=50)  # Kernel reduction
an.run_action(adata, k_min=2, k_max=30)   # ACTION decomposition
an.build_network(adata)                    # Build cell network
an.layout_network(adata)                   # UMAP layout

# Feature specificity
an.compute_feature_specificity(adata, labels='assigned_archetype')

# Visualize
sc.pl.embedding(adata, basis='X_umap', color='assigned_archetype')

Core Functions

Dimensionality Reduction

an.reduce_kernel(adata, n_components=50, layer=None, key_added='action')

Compute reduced kernel matrix using SVD. Automatically selects the optimal SVD algorithm based on matrix properties (sparse vs dense, size, sparsity) with negligible overhead (~1-2 microseconds).

Available algorithms:

IRLB (default for sparse): Implicitly Restarted Lanczos Bidiagonalization
Halko (default for dense): Randomized SVD (fastest for dense matrices)
PRIMME (auto-selected for large sparse): Memory-efficient for huge sparse matrices
Feng: Alternative randomized method

ACTION Decomposition

an.run_action(adata, k_min=2, k_max=30, reduction_key='action')

Multi-resolution archetypal analysis to identify cell states.

Network Construction

an.build_network(adata, archetype_key='H_stacked', 
                 algorithm='k*nn', distance_metric='jsd')

Build cell-cell interaction network from archetype footprints.

Network Diffusion

an.compute_network_diffusion(adata, scores='H_merged', 
                              network_key='actionet', alpha=0.85)

Smooth scores over the network topology.

Feature Specificity

an.compute_feature_specificity(adata, labels='assigned_archetype')

Compute archetype/cluster-specific marker genes.

Layout Visualization

an.layout_network(adata, network_key='actionet',
                  method='umap', n_components=2)

Compute 2D/3D layout using UMAP or t-SNE.

Batch Correction

an.correct_batch_effect(adata, batch_key='batch',
                        reduction_key='action',
                        corrected_key='action_corrected')

Remove batch effects from reduced representation using orthogonalization.

an.correct_basal_expression(adata, basal_genes=['ACTB', 'GAPDH'],
                            reduction_key='action',
                            corrected_key='action_basal_corrected')

Correct for basal/housekeeping gene expression.

Imputation

an.impute_features(adata, features=['GENE1', 'GENE2'],
                   network_key='actionet', alpha=0.85)

Impute gene expression using network diffusion.

an.impute_from_archetypes(adata, features=['GENE1', 'GENE2'],
                          H_key='H_merged')

Impute expression from archetype profiles.

an.smooth_kernel(adata, reduction_key='action',
                 smoothed_key='action_smoothed', alpha=0.85)

Smooth reduced representation using network diffusion.

AnnData Structure

ACTIONet stores results in standard AnnData slots:

adata.obsm: Dimensionality reductions and archetype matrices
- action: Reduced kernel representation
- H_stacked: Stacked archetype matrix (all scales)
- H_merged: Merged archetype matrix (unified)
- X_umap: UMAP coordinates
adata.obsp: Cell-cell networks
- actionet: ACTIONet graph adjacency matrix
adata.obs: Cell metadata
- assigned_archetype: Discrete archetype assignments
adata.varm: Gene/feature metadata
- specificity_upper: Marker gene scores (upper-tail)
- specificity_lower: Marker gene scores (lower-tail)
adata.uns: Parameters and auxiliary data
- action_params: Kernel reduction parameters
- action_results: ACTION decomposition results (C matrices)

R → Python API Mapping

R Function	Python Function	Notes
`reduce()`	`an.reduce_kernel()`	Kernel reduction
`runACTION()`	`an.run_action()`	ACTION decomposition
`buildNetwork()`	`an.build_network()`	Network construction
`computeNetworkDiffusion()`	`an.compute_network_diffusion()`	Network smoothing
`compute_archetype_feature_specificity()`	`an.compute_feature_specificity()`	Marker genes
`layoutNetwork()`	`an.layout_network()`	UMAP/t-SNE layout
`runSVD()`	`an.run_svd()`	SVD decomposition
`orthogonalizeBatchEffect()`	`an.correct_batch_effect()`	Batch correction
`orthogonalizeBasal()`	`an.correct_basal_expression()`	Basal correction
`imputeFeatures()`	`an.impute_features()`	Network diffusion imputation
`imputeFromArchetypes()`	`an.impute_from_archetypes()`	Archetype-based imputation
`smoothKernel()`	`an.smooth_kernel()`	Kernel smoothing
`colMaps(ace)`	`adata.obsm`	Cell-level embeddings
`colNets(ace)`	`adata.obsp`	Cell-level networks
`metadata(ace)`	`adata.obs`	Cell annotations
`rowMaps(ace)`	`adata.varm`	Gene-level annotations

Data Container Translation

R (ACTIONetExperiment)	Python (AnnData)
`assays(ace)$counts`	`adata.X` or `adata.layers['counts']`
`assays(ace)$logcounts`	`adata.layers['logcounts']`
`colMaps(ace)$ACTION`	`adata.obsm['action']`
`colMaps(ace)$H_stacked`	`adata.obsm['H_stacked']`
`colNets(ace)$ACTIONet`	`adata.obsp['actionet']`
`metadata(ace)$assigned_archetype`	`adata.obs['assigned_archetype']`
`rowMaps(ace)$specificity`	`adata.varm['specificity_upper']`

Examples

See examples/ directory for complete workflows:

01_basic_pipeline.py: End-to-end ACTIONet analysis
02_graph_building.py: Network construction strategies
03_integration_with_scanpy.py: Integration with scanpy workflows
04_batch_correction_imputation.py: Batch correction and imputation workflows

Building From Source

Build Configuration

The build system uses scikit-build-core with CMake. Key options:

# Standard build
pip install .

# Enable architecture-specific optimizations (Linux only)
pip install . -C cmake.define.ACTIONET_ENABLE_OPTIMIZED=ON

# Verbose build output
pip install . -v

Platform-Specific Notes

macOS:

Default deployment target: macOS 11.0
Builds native architecture (x86_64 or arm64)
Uses Accelerate framework for BLAS/LAPACK
Set CMAKE_OSX_ARCHITECTURES for cross-compilation

Linux:

Targets manylinux2014 (glibc ≥ 2.17)
OpenMP runtime defaults to AUTO (compiler-selected); override with -C cmake.define.LIBACTIONET_OPENMP_RUNTIME=GNU|INTEL|LLVM|OFF
For best performance, consider building with Intel MKL (see installation section above)

Troubleshooting

Submodule not initialized:

git submodule update --init --recursive

Missing Armadillo: Armadillo headers are bundled in libactionet/include/extern. If CMake can't find them, check submodule status.

OpenMP warnings: OpenMP is optional. If unavailable, the package builds with single-threaded C++ code (you can still use n_threads parameter via Python's multiprocessing). If using MKL (e.g., conda numpy), avoid mixed OpenMP runtimes by setting MKL_THREADING_LAYER=GNU when using GNU OpenMP, or by selecting Intel OpenMP with an Intel toolchain.

Examples:

# Force GNU OpenMP and align MKL
MKL_THREADING_LAYER=GNU pip install . -C cmake.define.LIBACTIONET_OPENMP_RUNTIME=GNU

# Use Intel OpenMP (Intel/IntelLLVM toolchains)
pip install . -C cmake.define.LIBACTIONET_OPENMP_RUNTIME=INTEL

Linking errors on macOS: Ensure Xcode Command Line Tools are installed and up to date.

Development

Running Tests

pip install -e ".[dev]"
pytest tests/ -v

Code Formatting

black src/actionet examples tests
ruff check src/actionet examples tests

Omissions Report

The following R package components are not implemented in this Python translation:

Omitted Components

R-specific visualization helpers (plots.R, r_visualization.R, utils_plotting_*.R)
- Reason: Python ecosystem has mature alternatives.
- Alternative: Use scanpy.pl.* functions for visualization.
R-specific data I/O (data.R)
- Reason: AnnData provides native I/O; scanpy handles format conversion.
- Alternative: scanpy.read_*() and adata.write_h5ad().
Parallel backend utilities (utils_parallel.R)
- Reason: Different parallelization approach. C++ threading via OpenMP is preserved.
- Alternative: Use n_threads parameter in functions.
Enrichment database utilities (enrichment.R)
- Reason: Python has dedicated packages.
- Alternative: Use gseapy, gprofiler-official, or decoupler.
Projection (projection.R)
- Reason: Requires R-specific reference dataset handling.
- Alternative: Future work or use scvi-tools for reference mapping.
Pseudobulk DGE (pseudobulk_DGE.R)
- Reason: Wrapper around R statistics packages.
- Alternative: Use scanpy.tl.rank_genes_groups(), pydeseq2, or call R via rpy2.
Marker detection helpers (marker_detection.R)
- Reason: High-level R wrappers.
- Alternative: an.compute_feature_specificity() + post-processing.
Alignment (alignment.R)
- Reason: Multi-dataset alignment utilities specific to R workflows.
- Alternative: Use scanpy integration tools or scvi-tools.
Filter ACE (filter_ace.R)
- Reason: ACTIONetExperiment-specific filtering.
- Alternative: Standard AnnData filtering: adata[adata.obs['column'] > threshold, :].
Autocorrelation statistics (Moran's I, Geary's C)
- Reason: Low usage; available via squidpy for spatial data.
- Alternative: squidpy.gr.spatial_autocorr().
Maximum-weight matching (MWM)
- Reason: Utility for batch alignment (omitted feature).
- Alternative: scipy.optimize.linear_sum_assignment().
XICOR correlation
- Reason: Specialized rank-based correlation; niche use case.
- Alternative: scipy.stats.spearmanr() or pandas correlation methods.

Implemented Core Modules

✅ Action decomposition: All archetypal analysis functions
✅ Network construction: Full graph building pipeline
✅ Network diffusion: Smoothing and propagation
✅ Feature specificity: Marker gene identification
✅ SVD/Kernel reduction: Dimensionality reduction
✅ Visualization layouts: UMAP/t-SNE via C++ backend
✅ Matrix operations: Aggregation, normalization, transforms

License

GPL-3.0 (same as R package)

Links

C++ Backend: https://github.com/KellisLab/libactionet
R Package: https://github.com/KellisLab/actionet-r
R Data Container: https://github.com/shmohammadi86/ACTIONetExperiment
AnnData: https://anndata.readthedocs.io
scanpy: https://scanpy.readthedocs.io

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
R		R
src		src
tests		tests
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
TODO.md		TODO.md
WRAPPER_STRUCTURE.md		WRAPPER_STRUCTURE.md
install_optimized.sh		install_optimized.sh
pyproject.toml		pyproject.toml
setup.py		setup.py

License

KellisLab/actionet-python

Folders and files

Latest commit

History

Repository files navigation

ACTIONet Python

Features

Installation

Prerequisites

Install from source

Building with Intel MKL (Recommended for Best Performance)

Option 1: Using Conda (Easiest)

Option 2: Using Intel oneAPI (Most Optimized)

Option 3: System MKL (Linux)

Verifying MKL Usage

Performance Tuning with MKL

Expected Performance Improvements

Troubleshooting MKL Builds

Quick Start

Core Functions

Dimensionality Reduction

ACTION Decomposition

Network Construction

Network Diffusion

Feature Specificity

Layout Visualization

Batch Correction

Imputation

AnnData Structure

R → Python API Mapping

Data Container Translation

Examples

Building From Source

Build Configuration

Platform-Specific Notes

Troubleshooting

Development

Running Tests

Code Formatting

Omissions Report

Omitted Components

Implemented Core Modules

License

Links

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages