PromptShield

A modular Python middleware system that intercepts user prompts before they reach large language models (LLMs) and intelligently filters, classifies, and routes queries to reduce wasted compute resources and costs.

Features

Prompt Classification: Hybrid rules + ML model approach to classify prompts as nonsense, spam, repeat, low-cost, or valuable
Query Routing: Intelligent routing to block, cache, or send to appropriate model based on classification
Cost and Usage Analytics: Track metrics on blocked prompts, model usage, and estimated cost savings
Flexible Deployment: Works with both closed API models and self-hosted open-weight models
Developer Integrations: Python SDK and CLI tools for easy integration

Supported Models

Closed API Models

OpenAI GPT-3.5/4
Anthropic Claude

Self-hosted Open-weight Models

LLaMA via Ollama
vLLM/TGI

Installation

pip install promptshield

Or install from source:

git clone https://github.com/harshadindigal/PromptGuard.git
cd PromptGuard
pip install -e .

Configuration

PromptShield uses a YAML configuration file. Here's a sample configuration:

routing:
  rules:
    - if: "label == 'nonsense' or label == 'spam'"
      action: "block"
    - if: "label == 'repeat'"
      action: "cache"
    - if: "label == 'low_cost'"
      model: "cheap_model"
    - if: "label == 'valuable'"
      model: "default_model"

models:
  openai:
    default_model: "gpt-4"
    cheap_model: "gpt-3.5-turbo"
  ollama:
    default_model: "llama3-70b"
    cheap_model: "mistral-instruct"
  anthropic:
    default_model: "claude-v1"
    cheap_model: "claude-haiku"

Usage

API Server

Start the API server:

uvicorn promptshield.main:app --host 0.0.0.0 --port 8080

Send a request to the API:

import requests

response = requests.post(
    "http://localhost:8080/chat",
    json={
        "prompt": "What is the capital of France?",
        "session_id": "user123",
        "config": {
            "source": "openai",
            "default_model": "gpt-4",
            "cheap_model": "gpt-3.5-turbo"
        }
    }
)

print(response.json())

Python SDK

from promptshield.sdk import PromptShieldSDK

# Initialize the SDK
sdk = PromptShieldSDK(api_url="http://localhost:8080")

# Classify a prompt
classification = sdk.classify_prompt("What is the capital of France?")
print(classification)

# Route a prompt
response = sdk.route_prompt(
    prompt="What is the capital of France?",
    session_id="user123",
    source="openai",
    default_model="gpt-4",
    cheap_model="gpt-3.5-turbo"
)
print(response)

CLI

# Classify a prompt
promptshield classify "What is the capital of France?"

# Route a prompt
promptshield route "What is the capital of France?" --source openai --default-model gpt-4 --cheap-model gpt-3.5-turbo

# Process a file of prompts
promptshield process prompts.json --output results.json

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
logs		logs
promptshield		promptshield
tests		tests
.cache		.cache
.config		.config
.npm		.npm
.npmrc		.npmrc
README.md		README.md
__init__.py		__init__.py
api.log		api.log
cache.py		cache.py
cache_factory.py		cache_factory.py
chat_history.xml		chat_history.xml
classifier.py		classifier.py
cli.py		cli.py
code_execution_output_turn_25.log		code_execution_output_turn_25.log
config.yaml		config.yaml
create_readme.py		create_readme.py
enhance_implementation.py		enhance_implementation.py
final_summary.py		final_summary.py
main.py		main.py
nltk_data		nltk_data
node_modules		node_modules
original_user_query.txt		original_user_query.txt
project_summary.py		project_summary.py
prompt_logs.jsonl		prompt_logs.jsonl
promptshield.log		promptshield.log
push_to_github.py		push_to_github.py
push_to_github_fixed.py		push_to_github_fixed.py
redis_cache.py		redis_cache.py
requirements.txt		requirements.txt
router.py		router.py
sdk.py		sdk.py
setup.py		setup.py
setup_project_structure.py		setup_project_structure.py
test_cache.py		test_cache.py
test_classifier.py		test_classifier.py
test_prompts.json		test_prompts.json
test_router.py		test_router.py
tiktoken_cache		tiktoken_cache
tmp_code_092b08cd58a00506b486331ba153e887.bash		tmp_code_092b08cd58a00506b486331ba153e887.bash
tmp_code_2159958a9f4b4f1befc65d6a8c482a79.bash		tmp_code_2159958a9f4b4f1befc65d6a8c482a79.bash
tmp_code_2396db868835baef130b6ed075265abf.bash		tmp_code_2396db868835baef130b6ed075265abf.bash
tmp_code_6488cef7fcb65b400285ef991ac95fa5.bash		tmp_code_6488cef7fcb65b400285ef991ac95fa5.bash
tmp_code_7e0a0359a45d3591d67daa18db889969.bash		tmp_code_7e0a0359a45d3591d67daa18db889969.bash
tmp_code_816dc52cd85b63e14fdc1bfbd422f060.py		tmp_code_816dc52cd85b63e14fdc1bfbd422f060.py
verify_implementation.py		verify_implementation.py
verify_setup.py		verify_setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PromptShield

Features

Supported Models

Closed API Models

Self-hosted Open-weight Models

Installation

Configuration

Usage

API Server

Python SDK

CLI

License

About

Uh oh!

Releases

Packages

Languages

harshadindigal/PromptGuard

Folders and files

Latest commit

History

Repository files navigation

PromptShield

Features

Supported Models

Closed API Models

Self-hosted Open-weight Models

Installation

Configuration

Usage

API Server

Python SDK

CLI

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages