Cub

Work ahead of your AI coding agents, then let them run.

Cub is for developers who are already running AI coding CLIs (Claude Code, Codex, OpenCode) in autonomous mode and want more structure. If you're juggling multiple agent sessions, manually routing work to different models, or finding that fully hands-off agents tend to run amok—Cub helps you work ahead of execution so you can be more hands-off during execution.

Quick Install

curl -LsSf https://install.cub.tools | bash

Then restart your shell (or open a new terminal) and run:

cub init --global

(Already installed? Run pipx upgrade cub or re-run the installer.)

The Problem

AI coding agents in 2026 are powerful. They can operate for hours, produce working code, run tests, and iterate toward production quality. But there's a gap:

Too hands-on: Sitting in an IDE, approving every tool call, staying close to the work
Too hands-off: Letting agents run wild with vague instructions, hoping for the best

Cub finds the balance. You invest time before code starts flying—breaking work into agent-sized tasks, routing complexity to the right models, reviewing the plan—then step back and let execution happen more seamlessly.

Two Main Steps: Prep and Run

Step 1. `cub prep`: Go From a Vision to Structured Tasks

Bring your ideas (a sentence, a spec, a whole design doc) and go through a structured interview to generate clear tasks for an LLM:

Triage — What are we trying to accomplish? What are the goals?
Architect — What makes sense technically? What's the implementation approach?
Plan — Break it into agent-sized chunks with clear acceptance criteria
Bootstrap — Write tasks to your chosen backend (beads or JSON)

The goal: observable, reviewable work before any code is written. No gaps in understanding slip through.

cub prep                    # Run full pipeline
cub triage                  # Or run stages individually
cub architect
cub plan
cub bootstrap

Step 2: `cub run`: Turn Tasks Into Code

Once you have structured tasks, Cub runs the Ralph Wiggum loop—picking ready tasks, generating prompts, invoking your chosen AI harness, and iterating until done or budget exhausted.

cub run                     # Run until complete
cub run --once              # Single iteration
cub run --epic my-feature   # Target specific work

The execution loop handles dependency ordering, failure recovery, git commits, and structured logging. You can watch it stream or check in later.

Key Features

Right Model for the Task

Not everything needs Opus. Cub supports per-task model selection:

bd label add cub-abc model:haiku     # Simple rename, use fast model
bd label add cub-xyz model:sonnet    # Medium complexity
bd label add cub-123 model:opus      # Complex architecture work

Route simple refactoring to Haiku, medium tasks to Sonnet, reserve Opus for planning and complex work. Manage tokens as a resource.

Multi-Harness Flexibility

Cub abstracts across multiple AI coding CLIs:

Claude Code — General coding, complex refactoring (default)
OpenAI Codex — Quick fixes, OpenAI ecosystem
Google Gemini — Alternative perspective
OpenCode — Open-source option

Each harness evolves rapidly. New capabilities emerge in one that may not exist in others. Cub lets you use the right tool without vendor lock-in.

cub run --harness claude    # Explicit selection
cub run --harness codex

Deterministic Control Layer

Building outside any single harness means the core loop—task selection, success/failure detection, retry logic, state transitions—runs as traditional software, not LLM inference. This enables:

Reliable hooks: Email when a task completes, not "hopefully the agent remembers"
Consistent logging: Structured JSONL, not scattered console output
Predictable budgets: Hard limits that actually stop execution

Features

Autonomous Loop: Runs until all tasks complete or budget exhausted
Dependency Tracking: Respects task dependencies, picks ready tasks
Priority Scheduling: P0-P4 priority-based task selection
Epic/Label Filtering: Target specific epics or labeled tasks
Budget Management: Token tracking with configurable limits and warnings
Guardrails: Iteration limits, secret redaction, safety controls
Failure Handling: Configurable modes (stop, move-on, retry, triage)
Session Management: Named sessions, artifact bundles per task
Git Workflow: Auto-branching, commit per task, clean state enforcement
Hooks System: Custom scripts at 5 lifecycle points
Structured Logging: JSONL logs with timestamps, durations, git SHAs
Dual Task Backend: Use beads CLI or simple JSON file
Streaming Output: Watch agent activity in real-time

Prerequisites

Python 3.10+ (required)
Harness (Claude and others):
- Claude Code CLI (claude) - Required for cub prep, recommended for cub run.
- OpenAI Codex CLI (codex)
- Google Gemini CLI (gemini)
- OpenCode CLI (opencode)
Task Backend (optional):
- beads CLI (bd) - For advanced task management

Installation

One-Liner (Recommended)

curl -LsSf https://install.cub.tools | bash

This will:

Install cub via pipx (installing pipx if needed)
Add cub to your PATH
Run cub init --global to set up config directories

Restart your shell after installation.

Alternative Methods

Using pipx manually

pipx install git+https://github.com/lavallee/cub.git
cub init --global

Using uv

uv tool install git+https://github.com/lavallee/cub.git
cub init --global

From source (for development)

git clone https://github.com/lavallee/cub ~/tools/cub
cd ~/tools/cub
uv sync  # or: python3.10 -m venv .venv && source .venv/bin/activate && pip install -e .
export PATH="$HOME/tools/cub/.venv/bin:$PATH"
cub init --global

Add the PATH export to your ~/.bashrc or ~/.zshrc.

Quick Start

cd my-project
cub init                    # Initialize project

Path A: Start with Prep (Recommended)

When starting new work, use the prep pipeline to turn your ideas into structured tasks:

# Run the full prep interview
cub prep

# Or run stages individually for more control
cub triage      # Clarify goals and requirements
cub architect   # Design technical approach
cub plan        # Break into agent-sized tasks
cub bootstrap   # Write tasks to backend

Path B: Create Tasks Directly

If you already know what needs doing:

# Using beads (recommended)
bd init
bd create "Implement user authentication" --type feature --priority 2
bd create "Add login form" --type task --priority 2

# Or edit prd.json directly (JSON backend)

Run the Loop

cub status              # Check what's ready
cub run                 # Run until complete
cub run --once          # Single iteration
cub run --epic my-epic  # Target specific work
cub run --stream        # Watch in real-time

Upgrading from v0.20 (Bash)? See UPGRADING.md for migration guide.

Usage

Prep Commands (Vision → Tasks)

# Full prep pipeline
cub prep                    # Run triage → architect → plan → bootstrap

# Individual stages (for more control)
cub triage                  # Clarify requirements and goals
cub architect               # Design technical implementation
cub plan                    # Break into agent-sized tasks
cub bootstrap               # Write tasks to backend

# Manage prep sessions
cub sessions                # List prep sessions
cub interview <task-id>     # Deep-dive on a specific task
cub interview --all --auto  # Batch interview all open tasks

Run Commands (Tasks → Code)

# Execute the loop
cub run                     # Run until all tasks complete
cub run --once              # Single iteration
cub run --ready             # Show ready (unblocked) tasks
cub run --plan              # Run planning mode
cub run --name myname       # Custom session name

# Filtering
cub run --epic <id>         # Target tasks within a specific epic
cub run --label <name>      # Target tasks with a specific label
cub run --epic cub-1gq --label phase-1  # Combine filters

# Harness selection
cub run --harness claude    # Use Claude Code (default)
cub run --harness codex     # Use OpenAI Codex CLI
cub run --harness gemini    # Use Google Gemini
cub run --harness opencode  # Use OpenCode

# Output modes
cub run --stream            # Stream harness activity in real-time
cub run --debug             # Enable verbose debug logging

Other Commands

# Setup
cub init                    # Initialize current project
cub init --global           # Set up global config

# Status and inspection
cub status                  # Show task progress
cub status --json           # JSON output for scripting
cub explain <task-id>       # Show full task details
cub artifacts               # List task outputs

# Git workflow
cub branch <epic-id>        # Create branch bound to epic
cub branches                # List branch-epic bindings
cub pr <epic-id>            # Create pull request for epic

# Utilities
cub validate                # Check beads state and config
cub doctor                  # Diagnose configuration issues
cub --migrate-to-beads      # Migrate prd.json to beads

Note: Running cub without a subcommand defaults to cub run.

Project Structure

After running cub init, your project will have:

my-project/
├── prd.json        # Task backlog (beads-style format)
├── PROMPT.md       # Loop prompt template (system instructions)
├── AGENT.md        # Build/run instructions for the agent
├── AGENTS.md       # Symlink to AGENT.md (for Codex compatibility)
├── progress.txt    # Session learnings (agent appends)
├── fix_plan.md     # Discovered issues and plans
├── specs/          # Detailed specifications
└── .cub/          # Cub runtime data (created during runs)
    ├── hooks/      # Project-specific hooks
    └── runs/       # Run artifacts and task outputs

Artifacts Directory

Each cub run creates artifacts in .cub/runs/{session-id}/:

.cub/runs/porcupine-20260111-114543/
├── run.json                    # Run metadata and config
└── tasks/
    └── cub-abc/
        ├── task.json           # Task execution details
        ├── summary.md          # AI-generated summary
        └── changes.patch       # Git diff of changes

View artifacts with:

cub artifacts                  # List all artifacts
cub artifacts cub-abc         # Show specific task artifacts

Task Backends

Cub supports two task management backends:

JSON Backend (Default)

Simple file-based task management using prd.json:

{
  "projectName": "my-project",
  "prefix": "myproj",
  "tasks": [
    {
      "id": "myproj-a1b2",
      "type": "feature",
      "title": "User authentication",
      "description": "Implement login functionality",
      "acceptanceCriteria": ["Login form renders", "Tests pass"],
      "priority": "P1",
      "status": "open",
      "dependsOn": [],
      "notes": ""
    }
  ]
}

Beads Backend

For projects using the beads CLI:

# Install beads
brew install steveyegge/beads/bd

# Initialize in project
bd init

# Cub auto-detects .beads/ directory
cub status  # Uses beads backend automatically

Task Fields

Field	Description
`id`	Unique identifier (prefix + hash, e.g., `prd-a1b2`)
`type`	`epic`, `feature`, `task`, `bug`, `chore`
`title`	Short description
`description`	Full details, can use user story format
`acceptanceCriteria`	Array of verifiable conditions
`priority`	P0 (critical) to P4 (backlog)
`status`	`open`, `in_progress`, `closed`
`dependsOn`	Array of task IDs that must be closed first
`parent`	(Optional) Parent epic ID
`labels`	(Optional) Array of labels for filtering and model selection
`notes`	Agent-maintained notes

Per-Task Model Selection

Tasks can specify which Claude model to use via a model: label:

# In beads:
bd label add cub-abc model:haiku     # Use fast model for simple tasks
bd label add cub-xyz model:sonnet    # Use balanced model for complex tasks
bd label add cub-123 model:opus-4.5  # Use most capable model for hard tasks

In JSON backend, add labels to the task:

{
  "id": "prd-abc",
  "title": "Quick fix",
  "labels": ["model:haiku", "phase-1"]
}

When cub picks up a task with a model: label, it automatically sets CUB_MODEL to pass to the Claude harness.

Task Selection Algorithm

Find tasks where status == "open"
Filter to tasks where all dependsOn items are closed
Sort by priority (P0 first)
Pick the first one

AI Harnesses

Cub abstracts the AI coding CLI into a "harness" layer, supporting multiple backends.

For detailed capability matrix and technical reference, see docs/HARNESSES.md.

Claude Code (Default)

cub --harness claude
# or
export HARNESS=claude

Uses Claude Code's --append-system-prompt for clean prompt separation.

OpenAI Codex

cub --harness codex
# or
export HARNESS=codex

Uses Codex's --full-auto mode with combined prompts.

Google Gemini

cub --harness gemini
# or
export HARNESS=gemini

Uses Gemini CLI's -y (YOLO mode) for autonomous operation.

OpenCode

cub --harness opencode
# or
export HARNESS=opencode

Uses OpenCode's run subcommand with JSON output for token tracking.

Auto-Detection

By default, cub auto-detects available harnesses using this priority order:

Explicit HARNESS setting (CLI flag --harness or env var HARNESS)
Config priority array (harness.priority in config file)
Default detection order: claude > opencode > codex > gemini

Configuration Example

You can customize the harness priority in .cub.json or global config:

{
  "harness": {
    "priority": ["gemini", "claude", "codex", "opencode"]
  }
}

Cub will try each harness in order and use the first one available. If none are found, it falls back to the default order.

Budget Management

Cub provides token budget tracking to control AI API costs and prevent runaway spending.

How It Works

Cub tracks token usage across all tasks and enforces budget limits:

Per-task tracking: Each harness reports tokens used (where available)
Cumulative tracking: Total tokens tracked per session in logs
Warning threshold: Alert when budget usage reaches a configurable percentage
Hard limit: Loop exits when budget is exceeded

Budget Configuration

Set budget in your config file or via environment variable:

Global config (~/.config/cub/config.json):

{
  "budget": {
    "default": 1000000,
    "warn_at": 0.8
  }
}

Project override (.cub.json):

{
  "budget": {
    "default": 500000,
    "warn_at": 0.75
  }
}

Environment variable:

export CUB_BUDGET=2000000  # Overrides both config files
cub

Budget Parameters

Parameter	Default	Description
`budget.default`	1,000,000	Token budget limit per session
`budget.warn_at`	0.8	Warn when usage reaches this % (0.0-1.0)

Common Budget Examples

For development/testing (small projects):

export CUB_BUDGET=100000  # 100k tokens
cub

For medium projects (most use cases):

export CUB_BUDGET=1000000  # 1M tokens (default)
cub

For large projects (extensive refactoring):

export CUB_BUDGET=5000000  # 5M tokens
cub

For multi-day sessions:

# Set higher budget if running multiple iterations
export CUB_BUDGET=10000000  # 10M tokens
cub --max-iterations 200

Monitoring Budget Usage

Check token usage in structured logs:

# View all budget warnings
jq 'select(.event_type=="budget_warning")' ~/.local/share/cub/logs/myproject/*.jsonl

# Track total tokens per session
jq -s '[.[].data.tokens_used // 0] | add' ~/.local/share/cub/logs/myproject/*.jsonl

# Find high-cost tasks
jq 'select(.data.tokens_used > 10000)' ~/.local/share/cub/logs/myproject/*.jsonl

Guardrails

Cub includes safety guardrails to prevent runaway loops and protect sensitive information.

Iteration Limits

{
  "guardrails": {
    "max_task_iterations": 3,    // Max retries per task
    "max_run_iterations": 50,    // Max total iterations per run
    "iteration_warning_threshold": 0.8  // Warn at 80% of limit
  }
}

When a task exceeds max_task_iterations, it's marked as failed and skipped. When a run exceeds max_run_iterations, the entire run stops.

Secret Redaction

Cub automatically redacts sensitive patterns in logs and debug output:

{
  "guardrails": {
    "secret_patterns": [
      "api[_-]?key",
      "password",
      "token",
      "secret",
      "authorization",
      "credentials"
    ]
  }
}

Add custom patterns for project-specific secrets.

Failure Handling

Cub provides configurable failure handling modes:

{
  "failure": {
    "mode": "retry",
    "max_retries": 3
  }
}

Failure Modes

Mode	Behavior
`stop`	Stop immediately on first failure
`move-on`	Mark task failed, continue to next task
`retry`	Retry task with failure context (up to max_retries)
`triage`	(Future) Human-in-the-loop intervention

Failure Context

When using retry mode, subsequent attempts include context about what failed:

## Previous Attempt Failed
Exit code: 1
Error: Test failures in auth_test.py

Please fix the issues and try again.

Session Management

Each cub run creates a unique session with an auto-generated animal name:

# Auto-generated session name
cub run    # Creates: porcupine-20260111-114543

# Custom session name
cub run --name release-1.0    # Creates: release-1.0-20260111-114543

Session names are used for:

Git branch naming: cub/{session}/{timestamp}
Artifact directories: .cub/runs/{session}/
Log identification

Session Assignment

Tasks can be assigned to specific sessions (useful for parallel work):

# With beads backend
bd assign cub-abc porcupine

# View task assignment
bd show cub-abc | grep Assignee

Git Workflow

Cub follows a disciplined git workflow:

Branch Per Run

Each run creates a feature branch (when using the auto-branch hook):

main
└── cub/porcupine/20260111-114543

Commit Per Task

The AI commits after each completed task with a structured message:

task(cub-abc): Implement user authentication

- Added login form component
- Created auth API endpoints
- Added tests for auth flow

Task-Id: cub-abc
Co-Authored-By: Claude Sonnet <noreply@anthropic.com>

Clean State Enforcement

Cub verifies clean git state before and after tasks:

{
  "clean_state": {
    "require_commit": true,   // Require all changes committed
    "require_tests": false    // Optionally require tests pass
  }
}

Override via CLI:

cub run --require-clean      # Force clean state check
cub run --no-require-clean   # Disable clean state check

Environment Variables

Variable	Default	Description
`CUB_PROJECT_DIR`	`$(pwd)`	Project directory
`CUB_MAX_ITERATIONS`	`100`	Max loop iterations
`CUB_DEBUG`	`false`	Enable debug mode
`CUB_STREAM`	`false`	Enable streaming output
`CUB_BACKEND`	`auto`	Task backend: `auto`, `beads`, `json`
`CUB_EPIC`		Filter to tasks within this epic ID
`CUB_LABEL`		Filter to tasks with this label
`CUB_MODEL`		Override model for Claude harness
`CUB_BUDGET`		Override token budget (overrides config)
`HARNESS`	`auto`	AI harness: `auto`, `claude`, `codex`, `opencode`, `gemini`
`CLAUDE_FLAGS`		Extra flags for Claude Code
`CODEX_FLAGS`		Extra flags for Codex CLI
`GEMINI_FLAGS`		Extra flags for Gemini CLI
`OPENCODE_FLAGS`		Extra flags for OpenCode CLI

Configuration

Cub uses XDG-compliant configuration with global and project-level overrides.

For a complete reference of all configuration options, see docs/CONFIG.md.

Global Setup

cub init --global

Creates:

~/.config/cub/config.json - Global configuration
~/.config/cub/hooks/ - Hook directories
~/.local/share/cub/logs/ - Log storage
~/.cache/cub/ - Cache directory

Configuration Precedence

CLI flags (highest priority)
Environment variables
Project config (.cub.json in project root)
Global config (~/.config/cub/config.json)
Hardcoded defaults (lowest priority)

Config File Format

{
  "harness": {
    "default": "auto",
    "priority": ["claude", "codex"]
  },
  "budget": {
    "default": 1000000,
    "warn_at": 0.8
  },
  "loop": {
    "max_iterations": 100
  },
  "clean_state": {
    "require_commit": true,
    "require_tests": false
  },
  "hooks": {
    "enabled": true
  }
}

Project Override

Create .cub.json in your project root to override global settings:

{
  "budget": {
    "default": 500000
  },
  "loop": {
    "max_iterations": 50
  }
}

Structured Logging

Cub logs all task executions in JSONL format for debugging and analytics.

Log Location

~/.local/share/cub/logs/{project}/{session}.jsonl

Session ID format: YYYYMMDD-HHMMSS (e.g., 20260109-214858)

Log Events

Each task produces structured events:

{"timestamp":"2026-01-09T21:48:58Z","event_type":"task_start","data":{"task_id":"cub-abc","task_title":"Fix bug","harness":"claude"}}
{"timestamp":"2026-01-09T21:52:30Z","event_type":"task_end","data":{"task_id":"cub-abc","exit_code":0,"duration":212,"tokens_used":0,"git_sha":"abc123..."}}

Querying Logs

# Find all task starts
jq 'select(.event_type=="task_start")' ~/.local/share/cub/logs/myproject/*.jsonl

# Find failed tasks
jq 'select(.event_type=="task_end" and .data.exit_code != 0)' logs/*.jsonl

# Calculate total duration
jq -s '[.[].data.duration // 0] | add' logs/*.jsonl

Hooks

Cub provides a flexible hook system to integrate with external services and tools. Hooks are executable scripts that run at specific points in the cub lifecycle.

Hook Lifecycle

The hook execution flow through a typical cub session:

┌─────────────────────────────────────────────────┐
│                   cub Start                     │
└──────────────────┬──────────────────────────────┘
                   │
                   ▼
            ┌──────────────┐
            │ pre-loop ✓   │  (setup, initialization)
            └──────────────┘
                   │
                   ▼
        ┌──────────────────────┐
        │  Main Loop Starts    │
        └──────┬───────────────┘
               │
        ┌──────▼──────────┐
        │ pre-task ✓      │  (for each task)
        └────────┬────────┘
                 │
                 ▼
          ┌─────────────────┐
          │ Execute Task    │
          │  (harness)      │
          └────────┬────────┘
                   │
        ┌──────────┴──────────┐
        │                     │
        ▼                     ▼
   ┌──────────┐         ┌──────────┐
   │ Success  │         │ Failure  │
   └────┬─────┘         └────┬─────┘
        │                    │
        │              ┌─────▼──────┐
        │              │ on-error ✓ │  (alert, logs)
        │              └─────┬──────┘
        │                    │
        └────────┬───────────┘
                 │
                 ▼
           ┌────────────────┐
           │ post-task ✓    │  (metrics, notify)
           └────────┬───────┘
                    │
        ┌───────────┴──────────┐
        │                      │
        ▼                      ▼
    ┌────────┐           ┌──────────┐
    │ More   │           │ All Done │
    │ Tasks? │           └──────┬───┘
    └───┬────┘                  │
        │ yes                   │
        ▼                       │
   (Loop Back)                  │
        │                       │
        └───────────────────────┘
                   │
                   ▼
            ┌──────────────┐
            │ post-loop ✓  │  (cleanup, reports)
            └──────────────┘
                   │
                   ▼
            ┌──────────────┐
            │  Exit Loop   │
            └──────────────┘

Hook Points

Cub supports five hook points:

Hook	When It Runs	Use Cases
`pre-loop`	Before starting the main loop	Setup, initialization, cleanup from previous run
`pre-task`	Before each task execution	Prepare environment, start timers
`post-task`	After each task (success or failure)	Notifications, metrics, logging
`on-error`	When a task fails	Alerts, incident creation, diagnostics
`post-loop`	After the main loop completes	Cleanup, final notifications, reports

Hook Locations

Hooks are discovered from two locations (in order):

Global hooks: ~/.config/cub/hooks/{hook-name}.d/ - Available to all projects
Project hooks: ./.cub/hooks/{hook-name}.d/ - Specific to a project

All executable files in these directories are run in sorted order (alphabetically).

Context Variables

All hooks receive context via environment variables:

Variable	Available In	Description
`CUB_HOOK_NAME`	All	Name of the hook being executed
`CUB_PROJECT_DIR`	All	Project directory
`CUB_SESSION_ID`	pre-loop, post-loop	Unique session identifier
`CUB_HARNESS`	pre-loop, post-loop	Harness in use (claude, codex, etc.)
`CUB_TASK_ID`	pre-task, post-task, on-error	ID of the current task
`CUB_TASK_TITLE`	pre-task, post-task, on-error	Title of the current task
`CUB_EXIT_CODE`	post-task, on-error	Exit code from task execution (0 = success)

Example Hooks

Cub includes example hooks for common integrations:

examples/hooks/post-task/slack-notify.sh - Posts task completion to Slack
examples/hooks/post-loop/datadog-metric.sh - Sends metrics to Datadog
examples/hooks/on-error/pagerduty-alert.sh - Creates PagerDuty incidents on failure

To install an example hook:

# Copy to global hooks directory
mkdir -p ~/.config/cub/hooks/{post-task,post-loop,on-error}.d
cp examples/hooks/post-task/slack-notify.sh ~/.config/cub/hooks/post-task.d/01-slack.sh
chmod +x ~/.config/cub/hooks/post-task.d/01-slack.sh

# Or to project-specific hooks
mkdir -p .cub/hooks/post-task.d
cp examples/hooks/post-task/slack-notify.sh .cub/hooks/post-task.d/01-slack.sh
chmod +x .cub/hooks/post-task.d/01-slack.sh

Each example script includes detailed installation and configuration instructions.

Writing Custom Hooks

Creating a hook is simple - just write a bash script:

#!/usr/bin/env bash
# Example hook script

# Hooks receive context as environment variables
echo "Task $CUB_TASK_ID completed with exit code $CUB_EXIT_CODE"

# Exit with 0 for success, non-zero for failure
exit 0

Requirements:

Script must be executable (chmod +x)
Script must exit with status 0 (success) or non-zero (failure)
Script should handle missing environment variables gracefully
Hook failures are logged but don't stop the loop by default (unless hooks.fail_fast is enabled in config)

Configuration

Hook behavior is controlled in your config file:

{
  "hooks": {
    "enabled": true,
    "fail_fast": false
  }
}

Option	Default	Description
`hooks.enabled`	`true`	Enable/disable all hooks
`hooks.fail_fast`	`false`	Stop loop if a hook fails (true) or continue (false)

How It Works

The Loop

┌──────────────────────────────────────────┐
│                 cub                      │
│                                           │
│  Tasks ────▶ Find Ready Task             │
│                     │                     │
│                     ▼                     │
│              Generate Prompt              │
│                     │                     │
│                     ▼                     │
│           AI Harness (claude/codex)       │
│                     │                     │
│                     ▼                     │
│              Task Complete?               │
│                /        \                 │
│               ▼          ▼                │
│            Loop        Done               │
└──────────────────────────────────────────┘

Prompt Structure

Cub generates two prompts for each iteration:

System Prompt (from PROMPT.md): Static instructions about workflow, rules, and completion signals
Task Prompt: Current task details including ID, description, and acceptance criteria

Feedback Loops

The agent runs these before committing:

Type checking (tsc, mypy, etc.)
Tests (jest, pytest, etc.)
Linting (eslint, ruff, etc.)
Build (if applicable)

If any fail, the agent must fix before proceeding.

Completion Signal

When all tasks have status: "closed", the agent outputs:

<promise>COMPLETE</promise>

This signals cub to exit the loop.

Advanced Usage

Streaming Mode

Watch agent activity in real-time:

cub run --stream

Shows tool calls, responses, and costs as they happen.

Debug Mode

Get verbose output for troubleshooting:

cub run --debug --once

Includes:

Full prompts being sent
Full harness command line (for copy-paste debugging)
Task selection details
Timing information
Acceptance criteria logging
Saves prompts to temp files

Planning Mode

Analyze codebase and update fix_plan.md:

cub run --plan

Uses parallel subagents to study code, find TODOs, and document issues.

Migrating to Beads

Convert existing prd.json to beads format:

# Preview what would happen
cub --migrate-to-beads-dry-run

# Perform migration
cub --migrate-to-beads

Tips

Task Sizing

Keep tasks small enough to complete in one iteration (~one context window). If a task feels big, break it into subtasks.

Specifications

The more detailed your specs, the better the output. Put them in specs/ and reference them in task descriptions.

Progress Memory

The agent appends to progress.txt after each task. This creates memory across iterations - patterns discovered, gotchas encountered.

Recovery

If the codebase gets into a broken state:

git reset --hard HEAD~1  # Undo last commit
cub                      # Restart loop

Choosing a Harness

Harness	Best For
Claude Code	General coding, complex refactoring, multi-file changes
Codex	Quick fixes, OpenAI ecosystem projects

Source Code Reference

Module	Purpose
`src/cub/cli/`	Typer CLI subcommands (run, status, init, prep commands)
`src/cub/core/config.py`	Configuration loading and merging
`src/cub/core/models.py`	Pydantic data models (Task, Config, etc.)
`src/cub/core/tasks/`	Task backends (beads, JSON)
`src/cub/core/harness/`	AI harness backends (Claude, Codex, Gemini, OpenCode)
`src/cub/core/logger.py`	Structured JSONL logging
`templates/PROMPT.md`	Default system prompt
`templates/AGENT.md`	Default agent instructions

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 600 Commits
.beads		.beads
.claude/commands		.claude/commands
.cub		.cub
.github/workflows		.github/workflows
docker		docker
docs		docs
examples/hooks		examples/hooks
scripts		scripts
specs		specs
src/cub		src/cub
templates		templates
tests		tests
.coverage		.coverage
.dockerignore		.dockerignore
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
AGENT.md		AGENT.md
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
PROMPT.md		PROMPT.md
README.md		README.md
ROADMAP.md		ROADMAP.md
coverage.json		coverage.json
plan_review.md		plan_review.md
progress.txt		progress.txt
pyproject.toml		pyproject.toml
uv.lock		uv.lock

License

lavallee/cub

Folders and files

Latest commit

History

Repository files navigation

Cub

Quick Install

The Problem

Two Main Steps: Prep and Run

Step 1. cub prep: Go From a Vision to Structured Tasks

Step 2: cub run: Turn Tasks Into Code

Key Features

Right Model for the Task

Multi-Harness Flexibility

Deterministic Control Layer

Features

Prerequisites

Installation

One-Liner (Recommended)

Alternative Methods

Quick Start

Path A: Start with Prep (Recommended)

Path B: Create Tasks Directly

Run the Loop

Usage

Prep Commands (Vision → Tasks)

Run Commands (Tasks → Code)

Other Commands

Project Structure

Artifacts Directory

Task Backends

JSON Backend (Default)

Beads Backend

Task Fields

Per-Task Model Selection

Task Selection Algorithm

AI Harnesses

Claude Code (Default)

OpenAI Codex

Google Gemini

OpenCode

Auto-Detection

Configuration Example

Budget Management

How It Works

Budget Configuration

Budget Parameters

Common Budget Examples

Monitoring Budget Usage

Guardrails

Iteration Limits

Secret Redaction

Failure Handling

Failure Modes

Failure Context

Session Management

Session Assignment

Git Workflow

Branch Per Run

Commit Per Task

Clean State Enforcement

Environment Variables

Configuration

Global Setup

Configuration Precedence

Config File Format

Project Override

Structured Logging

Log Location

Log Events

Querying Logs

Hooks

Hook Lifecycle

Hook Points

Hook Locations

Context Variables

Example Hooks

Writing Custom Hooks

Configuration

Step 1. `cub prep`: Go From a Vision to Structured Tasks

Step 2: `cub run`: Turn Tasks Into Code

Packages