Test/verify suite template #51

thejesshwar · 2026-01-07T17:29:53Z

Description

This PR adds a unit test (tests/test_suite_template.py) that implements the FAQAdapter pattern described in docs/templates/suite_template.py.

It verifies that the documentation example correctly implements the BaseAdapter interface (using prepare and execute) and produces valid AdapterOutcome objects.

Type of Change

Test coverage improvement

Changes Made

Created tests/test_suite_template.py which:
- Mocks a simple dataset and agent.
- Implements FAQAdapter inheriting from BaseAdapter.
- Asserts that the adapter flow functions correctly.

Testing

New test passes locally (poetry run pytest tests/test_suite_template.py)

Summary by CodeRabbit

Tests
- Added a test-suite template that simulates an end-to-end FAQ workflow using mock dataset and agent components, exercising adapter preparation, question querying, and response validation to ensure expected answers are produced consistently.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

continue · 2026-01-07T17:29:56Z

Learn more

All Green is an AI agent that automatically:

✅ Addresses code review comments

✅ Fixes failing CI checks

✅ Resolves merge conflicts

Unsubscribe from All Green comments

coderabbitai · 2026-01-07T17:30:03Z

Walkthrough

Adds a new test module implementing a mock FAQ-adapter workflow: data carrier, iterable mock dataset, mock agent, adapter with prepare/execute, and a test that runs the flow asserting successful outcomes and expected outputs.

Changes

Cohort / File(s)	Summary
Test module: FAQ adapter smoke test `tests/test_suite_template.py`	New test file adding `FAQItem` dataclass, `MockDataset` iterator, `MockAgent` (`connect()`, `answer()`), `FAQAdapter` (`__init__`, `prepare()`, `execute()` returning `AdapterOutcome`) and `test_suite_template_flow()` that prepares adapter, iterates dataset, executes adapter per item, and asserts success and expected output.

Sequence Diagram(s)

sequenceDiagram
    participant Test as Test (pytest)
    participant Dataset as MockDataset
    participant Adapter as FAQAdapter
    participant Agent as MockAgent

    Note over Test,Adapter: Setup
    Test->>Adapter: instantiate
    Test->>Adapter: Adapter.prepare()
    Adapter->>Agent: connect()
    Agent-->>Adapter: connected

    Note over Test,Dataset: Execution loop
    Test->>Dataset: for item in dataset
    Dataset-->>Test: FAQItem(question, answer)
    Test->>Adapter: Adapter.execute(FAQItem)
    Adapter->>Agent: answer(question)
    Agent-->>Adapter: response
    Adapter->>Test: AdapterOutcome(success, output)
    Test->>Test: assert outcome.success and output == expected

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~8 minutes

Possibly related issues

Add sample scenario tests for suite_template.py #4 — Same test-level components (FAQAdapter, MockAgent, MockDataset/FAQItem) added; likely directly related.
Add smoke test for suite_template FAQAdapter #7 — Implements the same FAQAdapter smoke test pattern (mock Agent with connect/answer, MockDataset, AdapterOutcome assertions); likely related.

Suggested reviewers

aviralgarg05

🚥 Pre-merge checks | ✅ 1 | ❌ 2

❌ Failed checks (1 warning, 1 inconclusive)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 14.29% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.
Title check	❓ Inconclusive	The title is vague and generic, using non-descriptive terms like 'suite template' that don't clearly convey the main change (adding a test for FAQAdapter pattern verification).	Consider a more specific title like 'Add unit test for FAQAdapter pattern' or 'Add test_suite_template.py to verify BaseAdapter implementation' to clearly indicate what is being added and tested.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description check	✅ Passed	The description covers main points but is incomplete according to the template: missing issue reference, incomplete testing section, and missing most code quality and documentation checklist items.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

📜 Recent review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 9149a67 and bff4ba7.

📒 Files selected for processing (1)

tests/test_suite_template.py

🚧 Files skipped from review as they are similar to previous changes (1)

tests/test_suite_template.py

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Test (Python 3.12)
GitHub Check: Test (Python 3.10)

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 2

🤖 Fix all issues with AI agents

In @tests/test_suite_template.py:
- Around line 37-44: The execute method signature is incompatible with
BaseAdapter: change the method signature of execute to match BaseAdapter (def
execute(self, case: DatasetCase, trace: TraceLog) -> AdapterOutcome), extract
the FAQItem from the provided case (e.g., faq = case.input or case.data) and use
faq.question/faq.answer for the agent call and comparison, properly handle or
forward the trace parameter (at minimum accept it; optionally record events on
trace if needed), return AdapterOutcome(success=success, output=response) with
the missing space after the comma fixed, and ensure references to
AdapterOutcome, DatasetCase, TraceLog, and FAQItem remain correct.
- Around line 46-61: The test incorrectly calls adapter.execute(item) with a
single argument while BaseAdapter.execute requires two parameters (case:
DatasetCase and trace: TraceLog), and it should use the agentunit framework test
runner; fix by replacing the manual loop with construction of a Scenario using
Scenario(name="demo", adapter=adapter, dataset=dataset) and call
run_suite([scenario]) to execute tests, ensuring any call sites or mocks conform
to BaseAdapter.execute(case, trace) signature (use a DatasetCase instance and a
TraceLog when invoking directly if needed).

🧹 Nitpick comments (1)

tests/test_suite_template.py (1)
4-4: Remove unused import.

The Scenario class is imported but never used in this test file.
♻️ Remove unused import
-from agentunit import Scenario

📜 Review details

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 1cb0d78 and 9149a67.

📒 Files selected for processing (1)

tests/test_suite_template.py

🧰 Additional context used

🧬 Code graph analysis (1)

tests/test_suite_template.py (1)

src/agentunit/adapters/base.py (2)

AdapterOutcome (18-25)

BaseAdapter (28-69)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (2)

GitHub Check: Test (Python 3.12)
GitHub Check: Test (Python 3.10)

🔇 Additional comments (3)

tests/test_suite_template.py (3)

7-10: LGTM!

The FAQItem dataclass is well-defined and appropriate for modeling FAQ data.

12-16: LGTM!

The MockDataset correctly implements the iterator protocol and provides appropriate test fixtures.

18-27: LGTM!

The MockAgent provides a simple, deterministic mock for testing the adapter flow.

coderabbitai · 2026-01-07T17:33:32Z

tests/test_suite_template.py

+    def execute(self, input_data: FAQItem) -> AdapterOutcome:
+        # The core logic we want to test
+        response = self.agent.answer(input_data.question)
+
+        # Simple exact match check
+        success = (response == input_data.answer)
+
+        return AdapterOutcome(success=success,output=response)


⚠️ Potential issue | 🔴 Critical

Critical: execute signature doesn't match BaseAdapter interface.

The execute method signature is incompatible with BaseAdapter. According to the base class definition, execute must accept (self, case: DatasetCase, trace: TraceLog) -> AdapterOutcome, but this implementation only takes input_data: FAQItem. This adapter won't work with the agentunit framework and violates the Liskov Substitution Principle.

Additionally, Line 44 has a minor formatting issue (missing space after comma).

🔧 Proposed fix

- def execute(self, input_data: FAQItem) -> AdapterOutcome: + def execute(self, case: FAQItem, trace: Any) -> AdapterOutcome: # The core logic we want to test - response = self.agent.answer(input_data.question) + response = self.agent.answer(case.question) # Simple exact match check - success = (response == input_data.answer) + success = (response == case.answer) - return AdapterOutcome(success=success,output=response) + return AdapterOutcome(success=success, output=response)

Note: You may need to properly handle the trace parameter if tracing is required for this adapter.

Run the following script to verify how other adapters in the codebase implement the BaseAdapter.execute method:

#!/bin/bash # Description: Find all adapter implementations to verify the correct execute signature # Search for classes that inherit from BaseAdapter ast-grep --pattern $'class $_ (BaseAdapter): $$$ ' # Search for execute method implementations in adapter files rg -n -A 5 'def execute' --type=py -g '*adapter*.py'

🤖 Prompt for AI Agents

In @tests/test_suite_template.py around lines 37 - 44, The execute method signature is incompatible with BaseAdapter: change the method signature of execute to match BaseAdapter (def execute(self, case: DatasetCase, trace: TraceLog) -> AdapterOutcome), extract the FAQItem from the provided case (e.g., faq = case.input or case.data) and use faq.question/faq.answer for the agent call and comparison, properly handle or forward the trace parameter (at minimum accept it; optionally record events on trace if needed), return AdapterOutcome(success=success, output=response) with the missing space after the comma fixed, and ensure references to AdapterOutcome, DatasetCase, TraceLog, and FAQItem remain correct.

coderabbitai · 2026-01-07T17:33:32Z

tests/test_suite_template.py

+def test_suite_template_flow():
+    """
+    Verifies that the FAQAdapter logic from the documentation template
+    works correctly with a mock agent.
+    """
+    # 1. Setup the scenario with our adapter and dataset
+    dataset = MockDataset()
+    adapter = FAQAdapter()
+    adapter.prepare()  
+
+    for item in dataset:
+        outcome = adapter.execute(item)  
+
+        # 3. Assertions
+        assert outcome.success is True
+        assert outcome.output == item.answer


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Find test files using Scenario class rg -n -B 2 -A 5 'Scenario' --type=py -g 'test_*.py' -g '*_test.py'

Repository: aviralgarg05/agentunit

Length of output: 35601

🏁 Script executed:

# Also check for Scenario class definition fd -e py -x rg -l 'class Scenario'

Repository: aviralgarg05/agentunit

Length of output: 314

🏁 Script executed:

# Check the BaseAdapter class and its execute method signature rg -n -A 10 'class BaseAdapter' --type=py

Repository: aviralgarg05/agentunit

Length of output: 751

🏁 Script executed:

# Check the current test file to understand its imports and structure cat -n tests/test_suite_template.py

Repository: aviralgarg05/agentunit

Length of output: 2315

🏁 Script executed:

# Check the complete BaseAdapter execute method signature rg -n -A 15 'def execute' src/agentunit/adapters/base.py

Repository: aviralgarg05/agentunit

Length of output: 640

🏁 Script executed:

# Check how execute is called in the actual runner/framework code rg -n 'execute(' src/agentunit/core/runner.py -B 2 -A 2

Repository: aviralgarg05/agentunit

Length of output: 132

🏁 Script executed:

# Check how execute is called in the runner rg -n 'adapter.execute' src/agentunit/core/ -B 2 -A 2

Repository: aviralgarg05/agentunit

Length of output: 486

🏁 Script executed:

# Check the Scenario class definition and how it uses adapters rg -n -A 20 'class Scenario' src/agentunit/core/scenario.py | head -60

Repository: aviralgarg05/agentunit

Length of output: 727

🏁 Script executed:

# Look at test_runner.py to see how scenarios are properly used with adapters head -100 tests/test_runner.py

Repository: aviralgarg05/agentunit

Length of output: 3173

Fix the execute() signature and use the agentunit framework's testing infrastructure.

The test calls adapter.execute(item) with a single argument, but BaseAdapter.execute() requires two parameters: case (DatasetCase) and trace (TraceLog). Additionally, the test should use the Scenario class (already imported) with the framework's run_suite() instead of manually iterating the dataset. This demonstrates proper framework integration, which is essential for a template that documents recommended patterns.

See tests/test_runner.py for the correct approach:

scenario = Scenario(name="demo", adapter=adapter, dataset=dataset) result = run_suite([scenario])

🤖 Prompt for AI Agents

In @tests/test_suite_template.py around lines 46 - 61, The test incorrectly calls adapter.execute(item) with a single argument while BaseAdapter.execute requires two parameters (case: DatasetCase and trace: TraceLog), and it should use the agentunit framework test runner; fix by replacing the manual loop with construction of a Scenario using Scenario(name="demo", adapter=adapter, dataset=dataset) and call run_suite([scenario]) to execute tests, ensuring any call sites or mocks conform to BaseAdapter.execute(case, trace) signature (use a DatasetCase instance and a TraceLog when invoking directly if needed).

codecov-commenter · 2026-01-07T17:38:41Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

thejesshwar added 2 commits January 7, 2026 22:56

test: add verification for documentation suite template

bdd687f

test: add verification for documentation suite template

9149a67

coderabbitai bot reviewed Jan 7, 2026

View reviewed changes

fix: resolve linting errors

bff4ba7

thejesshwar closed this Jan 7, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Test/verify suite template #51

Test/verify suite template #51

Uh oh!

thejesshwar commented Jan 7, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

continue bot commented Jan 7, 2026

Uh oh!

coderabbitai bot commented Jan 7, 2026 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Jan 7, 2026

Uh oh!

coderabbitai bot Jan 7, 2026

Uh oh!

codecov-commenter commented Jan 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Test/verify suite template #51

Test/verify suite template #51

Uh oh!

Conversation

thejesshwar commented Jan 7, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Changes Made

Testing

Summary by CodeRabbit

Uh oh!

continue bot commented Jan 7, 2026

Uh oh!

coderabbitai bot commented Jan 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related issues

Suggested reviewers

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jan 7, 2026

Choose a reason for hiding this comment

Uh oh!

codecov-commenter commented Jan 7, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

thejesshwar commented Jan 7, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 7, 2026 •

edited

Loading