🦞 RedPincer

AI/LLM Red Team Suite

Point RedPincer at any LLM API endpoint, select your attack modules, and run automated red team assessments with real-time streaming results, heuristic analysis, and exportable reports.

Warning

RedPincer is designed for authorized security testing and research only. Use it to audit AI systems you own or have explicit permission to test. Do not use this tool against systems without authorization.

✨ Features

🎯 Attack Engine

160+ Attack Payloads across 4 categories
Model-Specific Attacks for GPT, Claude, Llama
20 Variant Transforms (unicode, encoding, case, etc.)
Attack Chaining with template variables
AI-Powered Payload Generation via target LLM
Stop/Cancel running attacks instantly

📊 Analysis & Reporting

Heuristic Response Classifier with context-aware analysis
Vulnerability Heatmap — visual category x severity matrix
Custom Scoring Rubrics with weighted grades (A+ to F)
Verbose Pen-Test Reports with 10 sections + appendices
Multi-Target Comparison — side-by-side profiles
Regression Testing — track fixes over time

Core Capabilities

Category	Payloads	Description
💉 Prompt Injection	40	Instruction override, delimiter confusion, indirect injection, payload smuggling
🔓 Jailbreak	40	Persona splitting, gradual escalation, hypothetical framing, roleplay exploitation
🔍 Data Extraction	40	System prompt theft, training data probing, membership inference, embedding extraction
🛡️ Guardrail Bypass	40	Output filter evasion, multi-language bypass, homoglyph tricks, context overflow

Multi-Provider Support

OpenAI  ·  Anthropic  ·  OpenRouter  ·  Any OpenAI-compatible endpoint

🚀 What's New in v0.3

Bug Fixes

Auto-fetch models — Select from available models via dropdown after entering API key
Edit/delete targets — Full CRUD on saved LLM targets
Reduced false positives — Context-aware analysis detects "explain then refuse" patterns
Stop button — Cancel running attacks with AbortController
Verbose reports — 10-section professional pen-test quality reports

New Features

✨ AI Payload Generation — Use the target LLM to generate novel attack payloads
🧠 Adaptive Attack Engine — Analyzes weaknesses and suggests targeted follow-ups
📈 Multi-Target Comparison — Run same payloads against multiple models
🗺️ Vulnerability Heatmap — Visual matrix of success rates
🔁 Regression Testing — Save baselines, detect patched/new vulnerabilities
✏️ Custom Scoring Rubrics — Weighted criteria with letter grades
60 new payloads — Now 160 total (40 per category)

⚡ Quick Start

# Clone the repository
git clone https://github.com/rustyorb/pincer.git
cd pincer

# Install dependencies
npm ci

# Start development server
npm run dev

Open http://localhost:3000 to access the dashboard.

Build for Production

npm run build
npm start

🎮 Usage

Getting Started

graph LR
    A[Configure Target] --> B[Select Categories]
    B --> C[Run Attack]
    C --> D[Review Results]
    D --> E[Generate Report]
    D --> F[Run Adaptive Follow-up]
    E --> G[Export Markdown]

Configure a Target — Add an LLM endpoint with provider, API key, and model (auto-fetched)
Select Attack Categories — Check the categories to test
Run Attack — Hit RUN to stream attacks; hit STOP to cancel anytime
Review Results — Analyze with heuristic classification, severity scores, and leaked data highlights
Generate Report — Export comprehensive findings as Markdown

Advanced Tools

Tool	Description
Compare	Run same payloads against 2-4 targets simultaneously
Adaptive	Analyze weaknesses from a run, generate targeted follow-ups
Heatmap	Visual matrix of vulnerability rates by category and severity
Regression	Save baseline results, re-run later to detect fixes or regressions
Scoring	Define custom rubrics with weighted category/severity/classification scores
Chains	Build multi-step attacks with `{{previous_response}}` template variables
Payload Editor	Create custom payloads with syntax highlighting + AI generation

🏗️ Architecture

Data Flow

Target Config ──> POST /api/attack ──> NDJSON Stream ──> Heuristic Analysis ──> Zustand Store
                                                                                     │
                                                                              localStorage

All components are client-side ("use client") — no server components
Single-page layout — page.tsx switches views based on store.view
NDJSON streaming — real-time results from API routes
Heuristic analysis — pattern-matching classifier (no LLM-based grading)
Zustand + persist — state synced to localStorage

API Routes

Route	Method	Description
`/api/attack`	POST	Streams attack results as NDJSON
`/api/chain`	POST	Executes multi-step attack chains
`/api/test-connection`	POST	Validates endpoint connectivity
`/api/models`	POST	Fetches available models from provider
`/api/generate-payload`	POST	AI-powered payload generation

📂 Project Structure

src/
├── app/
│   ├── page.tsx                       # Main app with 12-view routing
│   ├── layout.tsx                     # Root layout + fonts
│   ├── globals.css                    # Tailwind + OKLCH color tokens
│   └── api/
│       ├── attack/route.ts            # Attack streaming (NDJSON)
│       ├── chain/route.ts             # Chain execution
│       ├── test-connection/route.ts   # Connection validation
│       ├── models/route.ts            # Model list fetching
│       └── generate-payload/route.ts  # AI payload generation
├── components/
│   ├── sidebar.tsx                    # Navigation + targets + run/stop
│   ├── target-config.tsx              # Target CRUD + model dropdown
│   ├── attack-modules.tsx             # Payload browser
│   ├── results-dashboard.tsx          # Results + analysis display
│   ├── report-generator.tsx           # Verbose report export
│   ├── chain-builder.tsx              # Multi-step chain editor
│   ├── session-manager.tsx            # Export/import sessions
│   ├── payload-editor.tsx             # Custom payloads + AI generation
│   ├── comparison-dashboard.tsx       # Multi-target comparison
│   ├── adaptive-runner.tsx            # Adaptive follow-up attacks
│   ├── vulnerability-heatmap.tsx      # Category × severity heatmap
│   ├── regression-runner.tsx          # Baseline regression testing
│   ├── scoring-config.tsx             # Custom scoring rubrics
│   └── ui/                            # shadcn/ui components
└── lib/
    ├── store.ts                       # Zustand store (persisted)
    ├── types.ts                       # TypeScript interfaces
    ├── llm-client.ts                  # Multi-provider LLM client
    ├── analysis.ts                    # Context-aware heuristic engine
    ├── adaptive.ts                    # Weakness analysis + follow-ups
    ├── scoring.ts                     # Custom scoring rubric engine
    ├── chains.ts                      # Attack chain definitions
    ├── variants.ts                    # 20 payload transforms
    ├── persistence.ts                 # Session export/import
    └── attacks/
        ├── index.ts                   # Payload aggregation + queries
        ├── injection.ts               # 40 prompt injection payloads
        ├── jailbreak.ts               # 40 jailbreak payloads
        ├── extraction.ts              # 40 data extraction payloads
        └── bypass.ts                  # 40 guardrail bypass payloads

🛠️ Tech Stack

Layer	Technology
Framework	Next.js 16 (App Router + Turbopack)
UI	React 19 + Tailwind CSS 4 + shadcn/ui
Language	TypeScript (strict mode)
State	Zustand 5 with persist middleware
Icons	Lucide React
Toasts	Sonner
Theme	Dark mode with custom OKLCH color tokens

📄 License

MIT — see LICENSE file for details.

Built for authorized AI security research and red teaming.

🦞 RedPincer — crack open those guardrails

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
public		public
src		src
.dockerignore		.dockerignore
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
RELEASE_NOTES.md		RELEASE_NOTES.md
components.json		components.json
eslint.config.mjs		eslint.config.mjs
next.config.ts		next.config.ts
package-lock.json		package-lock.json
package.json		package.json
postcss.config.mjs		postcss.config.mjs
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🦞 RedPincer

Table of Contents

✨ Features

🎯 Attack Engine

📊 Analysis & Reporting

Core Capabilities

Multi-Provider Support

🚀 What's New in v0.3

⚡ Quick Start

Build for Production

🎮 Usage

Getting Started

Advanced Tools

🏗️ Architecture

Data Flow

API Routes

📂 Project Structure

🛠️ Tech Stack

📄 License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

License

rustyorb/pincer

Folders and files

Latest commit

History

Repository files navigation

🦞 RedPincer

Table of Contents

✨ Features

🎯 Attack Engine

📊 Analysis & Reporting

Core Capabilities

Multi-Provider Support

🚀 What's New in v0.3

⚡ Quick Start

Build for Production

🎮 Usage

Getting Started

Advanced Tools

🏗️ Architecture

Data Flow

API Routes

📂 Project Structure

🛠️ Tech Stack

📄 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages