Pardon_Me

A GitHub Repo Insight Agent

Problem • Solution • Journey • Multi-Agent Architecture • High-Level Overview • Setup • Design • Future • Resources

1. Problem

Large GitHub repositories often read like dense technical forests—layers of code, scattered documentation, and enough jargon to overwhelm even experienced developers. The result is slow onboarding, unnecessary confusion, and a constant sense of “Where do I even start?”

I can see it affecting various types of people including:

New contributors struggling to understand project structure
Technical writers needing to document unfamiliar systems
Product managers seeking to grasp implementation details
Educators looking to explain real-world code to students

The bottleneck isn't just about missing documentation—it's about the gap between how code is written and how humans naturally understand systems.

2. Solution

This project grew out of that gap—a desire to make repositories a little less intimidating and a lot more readable.

Pardon_Me is a focused multi-agent system that understands user intent, analyzes the repo via GitHub MCP, identifies jargon, and produces a clear, accessible interpretation, while still allowing users to engage with it conversationally. They can ask follow-up questions, clarify unclear sections, or dive deeper into specific components—much like consulting a well-informed teammate who already understands the entire repository.

3. Project Journey and Evolution

Pardon_Me started off as a an idea that involved Parallel execution where each sub agent would analyse the code from different stakeholder POVs like a designer, product manager, QA engineer, etc. And then provide the user with an aggregated insight. The idea seemed solid but I had to pivot pretty quickly due to Github constraints and timeouts when I called the MCP for each sub-agent simultaneously. One workaround was to align the agents sequentially, but that defeated the entire purpose of parallel execution.

Additionally, I tried using them as an 'AgentTeam', where a parent LlmAgent would dynamically decide which sub-agents to execute parallelly- but google adk seems to not have that feature yet (I could be wrong, please let me know if you know a way).

Finally, I settled on a SequentialAgent solution where we could dedicate each agent for sequential tasks like Analyse user query-> extract essential code features -> decompose technical jargon -> summarise for easy understanding. That said, I did come across a couple hurdles while building this solution as well like:

Configuring the GitHub remote server
AgentTool's output isn't a valid adk event so when I tried to retrieve it for context compaction, it returned format errors. (I wasn't able to fix this - an open issue )

4. Agent Architecture/Pipeline

Once the user inputs a query, the backend system triggers the root_agent, which then uses five specialized agents in sequence:

User Analyst
- Interprets user requests and extracts true intent
- Identifies target files, components, and expected output format
Code Analyst
- Retrieves and analyzes code using GitHub MCP toolset
- Explains purpose, behavior, logic, dependencies, and execution flow

The GitHub MCP toolset provides secure, read-only access to repositories:

github_mcp_tool = McpToolset(
    connection_params=StreamableHTTPServerParams(
        url="https://api.githubcopilot.com/mcp/",
        headers={
            "Authorization": f"Bearer {GITHUB_TOKEN}",
            "X-MCP-Toolsets": "all",
            "X-MCP-Readonly": "true"
        },
        timeout=30,
    ),
)

This enables the Code Analyst agent to fetch live repository data without manual file uploads.

Jargon Detector
- Scans technical analysis for confusing terminology
- Identifies framework-specific language, acronyms, and advanced concepts
Readability Rewriter
- Transforms technical explanations into beginner-friendly language while maintaining correctness
- Removes or defines jargon identified in previous step
Aggregator 📋
- Merges technical depth with accessibility
- Produces final polished report aligned with user intent

Each of these agents have an output_key which is propagated to the next agent accordingly:

# Each agent outputs to a specific key
user_analyst → output_key="user_intent"
code_analyst → output_key="code_analysis"
jargon_detector → output_key="jargon_list"
readability_rewriter → output_key="readable_code_analysis"
aggregator → output_key="final_report"

4.1 Session Persistence

Conversations are stored in SQLite, allowing users to:

Continue previous conversations
Reference earlier explanations
Build understanding incrementally

session_service = DatabaseSessionService(db_url="sqlite+aiosqlite:///data/sessions.db")

4.2 Observability

To keep the whole pipeline transparent, I added an observability layer using LoggingPlugin(). It captures every step each agent takes—tools invoked, intermediate outputs, decisions, and even small reasoning breadcrumbs. Instead of leaving you guessing what the system is doing behind the curtain, the logs stream in real time on your terminal. It feels a bit like peeking into the brain of the agent chain as it collaborates, making it far easier to debug, track bottlenecks, or simply understand how the multi-agent pipeline behaves when processing a query.

4.3 Summary of Features

Multi-layered Analysis/ Multi-Agent system: Combines technical depth with beginner-friendly explanations with the help of Sequential Agents
Conversational Understanding: Users can ask follow-up questions, clarify unclear sections, or dive deeper into specific components—much like consulting a well-informed teammate who already understands the entire repository
Session Persistence: Conversations are stored in SQLite, enabling follow-up questions and context retention
Direct GitHub Integration: Uses GitHub MCP to access live repository data
Streaming Responses: Real-time agent output processing
Modular Design: Each agent has a single, well-defined responsibility

5. High Level Overview & Project Structure

Framework: FastAPI for REST API backend
AI Engine: Google Gemini 2.5 Flash Lite via ADK
GitHub Integration: GitHub MCP (Model Context Protocol) for repository access
Session Management: SQLite database for persistent conversations
Frontend: Vanilla HTML/CSS/JavaScript (served statically)

5.1 Project Structure

pardon_me/
├── agent/
│   ├── github_mcp_tool.py      # GitHub MCP toolset configuration
│   ├── pipeline_agents.py       # Five specialized agents
│   └── root_agent.py            # Root agent and pipeline wrapper
├── data/
│   └── sessions.db              # SQLite session storage (auto-created)
├── static/
│   ├── index.html               # Frontend UI
│   ├── script.js                # Client-side logic
│   └── style.css                # Styling
├── .env                         # Environment variables (create this)
├── .gitignore                   # Git ignore rules
├── app.py                       # FastAPI application entry point
├── requirements.txt             # Python dependencies
├── README.md                    # This file
└── utils.py                     # Session management utilities

6. Instructions for Setup

Prerequisites

Python 3.8 or higher
GitHub Personal Access Token (PAT) with repository read access
Google Gemini API Key

Step 1: Clone the Repository

git clone https://github.com/yourusername/pardon_me.git
cd pardon_me

Step 2: Install Dependencies

pip install -r requirements.txt

Required packages include:

fastapi - Web framework
uvicorn - ASGI server
python-dotenv - Environment variable management
google-adk - Google Agent Development Kit
google-genai - Google Generative AI SDK
pydantic - Data validation
aiosqlite - Async SQLite support

Step 3: Configure Environment Variables

Create a .env file in the project root:

touch .env

Add your API credentials:

GOOGLE_API_KEY=your_google_gemini_api_key_here
GITHUB_MCP_PAT=your_github_personal_access_token_here

Step 4: Run the Application

uvicorn app:app --reload

Step 5: Access the UI

Open your browser and navigate to:

http://localhost:8000

You'll see the Pardon_Me chat interface where you can start asking questions about GitHub repositories.

Example Queries

Try asking:

"Explain the main authentication flow in [repo-owner/repo-name]"
"What does the database schema look like in this project?"
"How does the API routing work in app.py?"
"Break down the agent pipeline in simple terms"

You can see the logs by LoggingPlugin() (for obervability) show up on your terminal like this:

7. Title and design

I titled the project Pardon_Me after the numerous times I've had to ask someone to repeat themselves when they seem to talk in another language made up of technical jargon! I find myself muttering it to myself while scrambling through a Github Repo I just opened as well.

As for the design, I just wanted something warm to work with while I practice. This was the initial prototype:

8. Future Enhancements

Enhanced UI: Rich code syntax highlighting and interactive diagrams
Advanced Analysis: Detect code smells, security issues, and architectural patterns
Advanced Features: Allow user with non-tech backgrounds to edit code using simple commands
Deployment (obviously🥹): Cloud deployment using Google Cloud Run or similar platform

9. List of Resources

Kaggle: 5-Day AI Agents Intensive Course with Google Google ADK Docs Remote GitHub MCP Server FastAPI Excalidraw (for diagrams)

Built with ❤️ -Jo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Pardon_Me

A GitHub Repo Insight Agent

1. Problem

2. Solution

3. Project Journey and Evolution

4. Agent Architecture/Pipeline

4.1 Session Persistence

4.2 Observability

4.3 Summary of Features

5. High Level Overview & Project Structure

5.1 Project Structure

6. Instructions for Setup

Prerequisites

Step 1: Clone the Repository

Step 2: Install Dependencies

Step 3: Configure Environment Variables

Step 4: Run the Application

Step 5: Access the UI

Example Queries

7. Title and design

8. Future Enhancements

9. List of Resources

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
agent		agent
static		static
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
utils.py		utils.py

Strange-14M605/Pardon_Me

Folders and files

Latest commit

History

Repository files navigation

Pardon_Me

A GitHub Repo Insight Agent

1. Problem

2. Solution

3. Project Journey and Evolution

4. Agent Architecture/Pipeline

4.1 Session Persistence

4.2 Observability

4.3 Summary of Features

5. High Level Overview & Project Structure

5.1 Project Structure

6. Instructions for Setup

Prerequisites

Step 1: Clone the Repository

Step 2: Install Dependencies

Step 3: Configure Environment Variables

Step 4: Run the Application

Step 5: Access the UI

Example Queries

7. Title and design

8. Future Enhancements

9. List of Resources

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages