Security Policy

Prompt Injection Prevention

This project implements robust security measures to prevent prompt injection attacks when interacting with AI agents. All AI integrations follow security best practices to protect against malicious inputs.

Overview

The project uses Anthropic Claude through the OpenCode GitHub Action for automated tool triage, categorization, and validation. All user-provided content is sanitized before being passed to AI agents to prevent prompt injection attacks.

Security Measures

1. Input Sanitization

All user-provided content undergoes sanitization before being used in AI prompts:

GitHub URLs: Validated against strict patterns, newlines stripped, special characters rejected
Repository Names: Alphanumeric validation, path traversal prevention, suspicious patterns blocked
File Paths: Whitelist-based validation, path traversal blocked, extension checking
Text Content: Injection patterns removed, encoded payloads detected, length limits enforced

2. Injection Pattern Detection

The system detects and blocks common injection patterns:

Role-switching attempts: "Ignore previous instructions", "You are now a..."
Instruction override: "Your new task is...", "System update:"
Delimiter injection: "---END SYSTEM PROMPT---", ""
Context confusion: Attempts to manipulate conversation structure
Encoded payloads: Base64, URL-encoded, or unicode-escaped injection attempts

3. Prompt Structure

All prompts use XML-style tags for clear content separation:

<system_instruction>
Your instructions here...
</system_instruction>

<user_input label="Repository URL">
https://github.com/user/repo
</user_input>

<!-- WARNING: Untrusted user input above -->

<instruction_reinforcement>
Remember: Always validate the tool before approval.
</instruction_reinforcement>

This structure ensures:

Clear boundaries between system instructions and user content
Explicit marking of untrusted input
Reinforcement of critical instructions after user content

Protected Call Sites

The following locations have prompt injection protection:

Triage Workflow (.github/scripts/post-triage-comment.cjs)
- Validates GitHub URLs from issue bodies
- Detects injection attempts in issue content
- Sanitizes URLs before prompt insertion
Categorization Workflow (.github/scripts/post-categorization-comment.cjs)
- Validates GitHub URLs and repository names
- Sanitizes category and theme data
- Wraps all user content in XML tags
Validation Workflow (.github/scripts/post-validation-comment.cjs)
- Validates file paths against whitelist
- Extracts and validates issue numbers
- Detects injection in PR bodies

Using Security Utilities

For TypeScript code, import from src/security/:

import {
  sanitizeGitHubUrl,
  sanitizeRepoName,
  sanitizeTextContent,
  SafePromptBuilder,
} from './security';

// Sanitize individual inputs
const cleanUrl = sanitizeGitHubUrl(userProvidedUrl);
const cleanName = sanitizeRepoName(repoName);

// Build safe prompts
const builder = new SafePromptBuilder();
const prompt = builder
  .setSystemInstruction('Analyze the repository')
  .addGitHubUrl(repoUrl)
  .addUserContent('Description', description)
  .setReinforcement('Remember to validate')
  .build();

// Check for injections
if (builder.hasDetectedInjections()) {
  console.warn('Injection attempts:', builder.getDetectedInjections());
}

Testing

Security modules have comprehensive test coverage (>96%):

# Run security tests
bun test src/security/

# Check coverage
bun test --coverage src/security/

Reporting Security Issues

If you discover a security vulnerability:

DO NOT open a public issue
Email the maintainers directly (see CODEOWNERS)
Include:
- Description of the vulnerability
- Steps to reproduce
- Potential impact
- Suggested fix (if any)

Security Best Practices

When contributing:

Never trust user input: Always sanitize before using in prompts
Use provided utilities: Don't implement custom sanitization
Test injection resistance: Add tests for new AI integration points
Log suspicious activity: Use detectInjectionAttempt() for monitoring
Review prompts carefully: Ensure clear separation of instructions and user content

References

JSON Schema Validation

All data files that populate AI prompts are validated against JSON schemas before use. This prevents data poisoning attacks and ensures data integrity.

Validated Files

data/categories.json - Category definitions
data/themes.json - Theme definitions

Validation Features

Schema Enforcement
- Required fields validation
- Type checking (strings, numbers, arrays, objects)
- Pattern validation (slugs, IDs, dates)
- Length constraints (min/max)
- Enum validation (status values)
Injection Detection
- All text fields checked for injection patterns
- Keywords, tags, and arrays validated
- Metadata fields sanitized
Performance
- Validation completes in <100ms per file
- Integrated into data loading pipeline
- Fails fast on validation errors

Using Validation

# Validate all data files
bun run validate:data

# Run validation in CI/CD
bun run validate:data || exit 1

In TypeScript code:

import { validateCategoriesFile, validateThemesFile } from './validation';

// Validate before loading
const result = validateCategoriesFile('./data/categories.json');
if (!result.valid) {
  console.error('Validation failed:', result.errors);
  process.exit(1);
}

Schema Files

JSON schemas are located in schemas/:

schemas/categories.schema.json - Categories validation schema
schemas/themes.schema.json - Themes validation schema

Both schemas follow JSON Schema Draft 2020-12 specification.

Security Monitoring and Logging

Comprehensive monitoring and logging infrastructure tracks all security events, providing visibility into threats and enabling proactive security management.

Monitoring Features

Structured Logging
- JSON-based log format for easy parsing
- Log levels: INFO, WARN, ERROR, CRITICAL
- Context enrichment (user, workflow, timestamp)
- Privacy-preserving (content hashes, not full content)
- Automatic 30-day log rotation
Metrics Collection
- Total injection attempts and blocked attempts
- Pattern breakdown (role-switching, delimiter-injection, etc.)
- User activity tracking
- Time-series data for trend analysis
- Rate limiting statistics
Security Dashboard
- Auto-generated daily markdown dashboard
- Visual charts and trend analysis
- Top users by injection attempts
- Pattern and workflow breakdowns
- Automated anomaly detection
Alerting
- Spike detection (>10 attempts/hour)
- Repeat offender tracking (>5 attempts/24h)
- New pattern detection
- Automated GitHub issue creation
- Webhook support for external alerts

Using Monitoring Tools

# Generate security dashboard
bun run security:dashboard --days 30

# Real-time log monitoring (local development)
bun run security:monitor --level WARN --category injection

# Historical analysis
bun run security:analyze --days 90 --format markdown

# View dashboard
open docs/security-dashboard.md

Integration with Security Modules

The monitoring system automatically tracks:

All injection attempts (from sanitize-ai-input.ts)
Rate limiting events (from rate-limit.ts)
Alert actions (from alert.ts)
Validation failures (from validate-*.ts)

import { logger } from './monitoring';

// Log security events
logger.logInjectionAttempt('username', 'triage', 'role-switching', true, 42);
logger.logRateLimit('username', 'user', true, 10);
logger.logAlert('issue-created', 'critical', 'Security alert', { issueNumber: 99 });

Dashboard Contents

The security dashboard (updated daily) includes:

Overview: Total attempts, block rate, unique users, averages
Trends: Weekly and monthly averages, trend direction
Patterns: Most common injection techniques
Users: Top offenders and their status
Recommendations: Actionable insights based on current data

Alerting Rules

Configured in config/security-alerts.json:

Spike Detection: >10 attempts/hour → Create GitHub issue
Repeat Offender: >5 attempts/24h → Block user
New Pattern: Unknown pattern detected → Notify security team
Low Block Rate: <70% success rate → Review detection rules

Log Privacy

Logging follows strict privacy guidelines:

✅ Log: timestamps, usernames, patterns, metadata
✅ Log: content hashes (first 8 chars of SHA-256)
❌ Never log: full user content, PII, sensitive data
❌ Never log: API keys, tokens, credentials

Monitoring Infrastructure

Logger: src/monitoring/logger.ts - Structured logging
Metrics: src/monitoring/metrics.ts - Data aggregation
Dashboard: src/bin/generate-security-dashboard.ts - Report generation
Monitor: src/bin/monitor-security.ts - Real-time monitoring
Analysis: src/bin/analyze-security-history.ts - Historical analysis
Workflow: .github/workflows/security-dashboard.yml - Automation

Updates

2025-11-26: Security monitoring and logging implementation
- Added structured logging infrastructure with JSON format
- Created metrics collection and aggregation module
- Implemented security dashboard generator
- Added real-time monitoring CLI tool
- Created historical analysis tool
- Configured automated dashboard updates (daily)
- Added anomaly detection and alerting
- Achieved 100% test coverage for logging, 94% for metrics
2025-11-25: JSON schema validation implementation
- Added JSON schemas for categories and themes
- Created validation module with comprehensive tests
- Integrated validation into data loading pipeline
- Added CLI tool for standalone validation
- Achieved 90%+ test coverage for validation modules
2025-11-25: Initial prompt injection prevention implementation
- Added input sanitization module
- Added safe prompt builder
- Updated all workflow scripts with security measures
- Achieved 96.88% test coverage

For detailed technical analysis of injection vectors and risk assessment, see AI Agent Security Analysis.

For monitoring dashboard, see Security Dashboard (auto-updated daily).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Security

SECURITY.md

Security Policy

Prompt Injection Prevention

Overview

Security Measures

1. Input Sanitization

2. Injection Pattern Detection

3. Prompt Structure

Protected Call Sites

Using Security Utilities

Testing

Reporting Security Issues

Security Best Practices

References

JSON Schema Validation

Validated Files

Validation Features

Using Validation

Schema Files

Security Monitoring and Logging

Monitoring Features

Using Monitoring Tools

Integration with Security Modules

Dashboard Contents

Alerting Rules

Log Privacy

Monitoring Infrastructure

Updates

There aren’t any published security advisories

Security: pantheon-org/awesome-opencode

Security

SECURITY.md

Security Policy

Prompt Injection Prevention

Overview

Security Measures

1. Input Sanitization

2. Injection Pattern Detection

3. Prompt Structure

Protected Call Sites

Using Security Utilities

Testing

Reporting Security Issues

Security Best Practices

References

JSON Schema Validation

Validated Files

Validation Features

Using Validation

Schema Files

Security Monitoring and Logging

Monitoring Features

Using Monitoring Tools

Integration with Security Modules

Dashboard Contents

Alerting Rules

Log Privacy

Monitoring Infrastructure

Updates

There aren’t any published security advisories