PrivacyBench CLI Framework

A production-ready CLI framework for privacy-preserving machine learning benchmarking. Converts notebook-based experiments into a modular, reproducible command-line interface.

🚀 Quick Start

Installation

# Install UV (if not already installed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install PrivacyBench in development mode
pip install -e .

# Verify installation
privacybench --help

Basic Usage

# List available experiments, datasets, models
privacybench list all

# Run CNN baseline experiment
privacybench run --experiment cnn_baseline --dataset alzheimer

# Run federated learning experiment  
privacybench run --experiment fl_cnn --dataset alzheimer

# Validate configuration file
privacybench validate --config configs/experiments/baselines/cnn_alzheimer.yaml

# Dry run (validate without execution)
privacybench run --experiment vit_baseline --dataset skin_lesions --dry-run

📋 Available Commands

`privacybench run`

Execute privacy-preserving ML experiments

Options:

--experiment: Experiment type (cnn_baseline, vit_baseline, fl_cnn, dp_cnn, etc.)
--dataset: Dataset to use (alzheimer, skin_lesions)
--config: Custom YAML configuration file
--output: Output directory for results (default: ./results)
--dry-run: Validate configuration without execution
--seed: Random seed for reproducibility (default: 42)

Examples:

privacybench run --experiment cnn_baseline --dataset alzheimer
privacybench run --experiment fl_dp_cnn --dataset skin_lesions --output ./my_results
privacybench run --config experiments/custom_config.yaml

`privacybench list`

Display available components

Options:

experiments: List available experiments
datasets: List available datasets
models: List available model architectures
privacy: List available privacy techniques
all: List everything (default)

Examples:

privacybench list experiments
privacybench list datasets
privacybench list all

`privacybench validate`

Validate experiment configurations

Options:

--config: YAML configuration file to validate (required)
--verbose: Show detailed validation information

Examples:

privacybench validate --config configs/experiments/baselines/cnn_alzheimer.yaml
privacybench validate --config my_experiment.yaml --verbose

🎯 Supported Experiments

Experiment	Model	Privacy	Dataset Support	Expected Accuracy
`cnn_baseline`	ResNet18	None	alzheimer, skin_lesions	97.9% (alzheimer)
`vit_baseline`	ViT-Base/16	None	alzheimer, skin_lesions	99.0% (alzheimer)
`fl_cnn`	ResNet18	Federated Learning	alzheimer, skin_lesions	~98.0% (alzheimer)
`fl_vit`	ViT-Base/16	Federated Learning	alzheimer, skin_lesions	TBD
`dp_cnn`	ResNet18	Differential Privacy	alzheimer, skin_lesions	TBD
`dp_vit`	ViT-Base/16	Differential Privacy	alzheimer, skin_lesions	TBD
`fl_dp_cnn`	ResNet18	FL + DP	alzheimer, skin_lesions	TBD
`smpc_cnn`	ResNet18	SMPC	alzheimer, skin_lesions	TBD

📊 Datasets

Alzheimer MRI Classification

Classes: 4 (NonDemented, VeryMildDemented, MildDemented, ModerateDemented)
Size: ~6,400 images
Type: Medical imaging
Usage: --dataset alzheimer

ISIC Skin Lesion Classification

Classes: 8 skin lesion types
Size: ~10,000 images
Type: Medical imaging
Usage: --dataset skin_lesions

🔒 Privacy Techniques

Baseline (No Privacy)

Standard training without privacy constraints
Fastest training, best accuracy
Use: cnn_baseline, vit_baseline

Federated Learning (FL)

Distributed training without sharing raw data
Moderate privacy, good utility
Use: fl_cnn, fl_vit

Differential Privacy (DP)

Mathematical privacy guarantees via noise injection
Strong privacy, reduced utility
Use: dp_cnn, dp_vit

Secure Multi-Party Computation (SMPC)

Cryptographic secure aggregation
Very strong privacy, significant overhead
Use: smpc_cnn, smpc_vit

Hybrid Combinations

FL + DP: Combined federated training with differential privacy
FL + SMPC: Federated training with cryptographic security
Use: fl_dp_cnn, fl_smpc_cnn

📁 Project Structure

privacybench/
├── cli/                    # CLI interface and commands
├── core/                   # Component registry and wrappers (Phase 2)
├── execution/              # Experiment execution engine (Phase 3)
├── output/                 # Results collection and export (Phase 4)
├── utils/                  # Utilities and helpers
├── legacy/                 # Preserved existing code
├── configs/                # YAML configuration files
├── tests/                  # Test suite (Phase 5)
└── examples/               # Usage examples

⚙️ Configuration

YAML Configuration Format

metadata:
  name: "cnn_baseline_alzheimer"
  experiment_type: "cnn_baseline"
  dataset: "alzheimer"

dataset:
  name: "alzheimer"
  config:
    augmentation: true
    test_split: 0.08
    validation_split: 0.1

model:
  architecture: "cnn"
  config:
    pretrained: true
    num_classes: 4
    dropout: 0.1

privacy:
  techniques: []  # No privacy for baseline

training:
  epochs: 50
  batch_size: 32
  learning_rate: 0.0002
  optimizer: "adam"
  tolerance: 7

output:
  directory: "./results"
  save_model: true
  export_formats: ["json", "csv"]

Privacy Configuration Examples

Federated Learning:

privacy:
  techniques:
    - name: "federated_learning"
      config:
        num_clients: 3
        num_rounds: 5
        strategy: "FedAvg"

Differential Privacy:

privacy:
  techniques:
    - name: "differential_privacy"
      config:
        epsilon: 1.0
        delta: 1e-5
        noise_multiplier: 1.0
        max_grad_norm: 1.0

Combined FL + DP:

privacy:
  techniques:
    - name: "federated_learning"
      config:
        num_clients: 3
        num_rounds: 5
    - name: "differential_privacy"
      config:
        epsilon: 1.0
        delta: 1e-5

📈 Output and Results

CLI Output

🎉 EXPERIMENT COMPLETED SUCCESSFULLY
═══════════════════════════════════════
📋 Experiment: cnn_baseline_alzheimer
📊 Dataset: alzheimer
🧠 Model: cnn  
🔒 Privacy: None (Baseline)
⏱️ Duration: 588.0 seconds

📈 PERFORMANCE METRICS:
 • Accuracy: 97.90%
 • F1 Score: 0.9785
 • ROC AUC: 0.9958

⚡ RESOURCE CONSUMPTION:
 • Training Time: 588.0 seconds
 • Peak GPU Memory: 1.20 GB
 • Energy Consumed: 0.026000 kWh
 • CO2 Emissions: 0.011830 kg

📁 Results saved to: ./results/exp_20250131_123456

File Outputs

results/exp_20250131_123456/
├── results.json           # Complete results
├── metrics.csv            # Key metrics table  
├── summary.md             # Human-readable summary
├── config.yaml            # Experiment configuration
└── emissions.csv          # CodeCarbon energy data

🔬 Development Status

Phase 1 (Current): ✅ CLI Foundation & Configuration

CLI entry point and command structure
YAML configuration parsing and validation
Integration with existing experiments.yaml
Command: privacybench list, privacybench validate, privacybench run --dry-run

Phase 2 (Next): 🔧 Component System & Wrappers

Component registry for datasets, models, privacy techniques
Wrapper implementations around existing code
Command: Full privacybench run without execution

Phase 3 (Future): 🚀 Execution Engine

Complete experiment orchestration
Integration with existing training code
Command: Full experiment execution

Phase 4 (Future): 📊 Results & Export System

Results collection and formatting
Multiple export formats and comparisons

Phase 5 (Future): 🧪 Testing & Production Polish

Comprehensive test suite
Production-ready packaging

🤝 Contributing

Development Setup

# Clone and install in development mode
git clone <repository>
cd privacybench
pip install -e .

# Run tests (Phase 5)
pytest tests/

# Validate installation
privacybench --help
privacybench list all

Adding New Experiments

Add experiment to legacy/experiments.yaml
Map CLI name in cli/parser.py
Add to choices in cli/main.py
Test: privacybench run --experiment new_experiment --dry-run

Adding New Datasets

Create wrapper in core/datasets/ (Phase 2)
Register in component registry
Add validation rules
Test with existing experiments

📖 Documentation

Quick Start: This README
API Reference: Coming in Phase 5
Configuration Guide: examples/ directory
Development Guide: Coming in Phase 5

📄 License

MIT License - see LICENSE file for details.

🙏 Acknowledgments

Built on top of existing PrivacyBench research codebase with:

PyTorch Lightning for training infrastructure
Flower for federated learning
Opacus for differential privacy
CodeCarbon for energy tracking
Weights & Biases for experiment logging

Note: This is Phase 1 implementation focused on CLI foundation and configuration. Actual experiment execution will be added in Phase 3.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
cli		cli
configs		configs
core		core
execution		execution
legacy		legacy
privacybench.egg-info		privacybench.egg-info
tests		tests
.gitnore		.gitnore
LICENSE		LICENSE
README.md		README.md
command.sh		command.sh
pyproject.toml		pyproject.toml

License

Federated-Learning-MLC/PrivacyBench

Folders and files

Latest commit

History

Repository files navigation

PrivacyBench CLI Framework

🚀 Quick Start

Installation

Basic Usage

📋 Available Commands

privacybench run

privacybench list

privacybench validate

🎯 Supported Experiments

📊 Datasets

Alzheimer MRI Classification

ISIC Skin Lesion Classification

🔒 Privacy Techniques

Baseline (No Privacy)

Federated Learning (FL)

Differential Privacy (DP)

Secure Multi-Party Computation (SMPC)

Hybrid Combinations

📁 Project Structure

⚙️ Configuration

YAML Configuration Format

Privacy Configuration Examples

📈 Output and Results

CLI Output

File Outputs

🔬 Development Status

🤝 Contributing

Development Setup

Adding New Experiments

Adding New Datasets

📖 Documentation

📄 License

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

`privacybench run`

`privacybench list`

`privacybench validate`

Packages