Fix QAEvaluator to pass through is_reasoning_model parameter #44342

Copilot · 2025-12-09T17:21:58Z

Description

QAEvaluator was not forwarding the is_reasoning_model parameter to its underlying evaluators, causing failures with reasoning models (o1/o3) that don't support max_tokens:

# Before: fails with reasoning models
qa_eval = QAEvaluator(model_config=config, is_reasoning_model=True)
# Error: "Unsupported parameter: 'max_tokens' is not supported with this model"

Changes

QAEvaluator: Added is_reasoning_model: bool = False parameter and passed it through to all model-based evaluators (GroundednessEvaluator, RelevanceEvaluator, CoherenceEvaluator, FluencyEvaluator, SimilarityEvaluator)
Tests: Added unit tests verifying parameter propagation with both default and explicit values
Formatting: Applied black formatting to evaluator instantiations and removed trailing whitespace

F1ScoreEvaluator unchanged as it performs local computation without LLM calls.

Usage

qa_eval = QAEvaluator(
    model_config=model_config,
    is_reasoning_model=True  # Now properly forwarded to sub-evaluators
)

All SDK Contribution checklist:

The pull request does not introduce [breaking changes]
CHANGELOG is updated for new features, bug fixes or other significant changes.
I have read the contribution guidelines.

General Guidelines and Best Practices

Title of the pull request is clear and informative.
There are a small number of commits, each of which have an informative message. This means that previously merged commits do not appear in the history of the PR. For more information on cleaning up the commits in your PR, see this page.

Testing Guidelines

Pull request includes test coverage for the included changes.

Original prompt

This section details on the original issue you should resolve

<issue_title>QAEvaluator does not pass through is_reasoning_model</issue_title>
<issue_description>I was trying to use the QAEvaluator w/ a reasoning model w/ the is_reasoning_model: true parameter and ran into the following error:
openai.BadRequestError: Error code: 400 - {'error': {'message': "Unsupported parameter: 'max_tokens' is not supported with this model. Use 'max_completion_tokens' instead.", 'type': 'invalid_request_error', 'param': 'max_tokens', 'code': 'unsupported_parameter'}}
This doesn't happen for individual evaluators that the QAEvaluator makes use of. It seems to be due to the fact that the QAEvaluator does not pass through the is_reasoning_model for the evaluators under the hood > https://github.com/Azure/azure-sdk-for-python/blob/main/sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_qa/_qa.py#L104-L111</issue_description>

Comments on the Issue (you are @copilot in this section)

Fixes QAEvaluator does not pass through is_reasoning_model #43842

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

Co-authored-by: nagkumar91 <4727422+nagkumar91@users.noreply.github.com>

Copilot

Pull request overview

This PR fixes a bug where QAEvaluator was not forwarding the is_reasoning_model parameter to its underlying model-based evaluators, causing failures when used with reasoning models (o1/o3) that don't support the max_tokens parameter.

Key changes:

Added is_reasoning_model parameter to QAEvaluator.__init__() with default value False
Forwarded the parameter to all five model-based sub-evaluators (GroundednessEvaluator, RelevanceEvaluator, CoherenceEvaluator, FluencyEvaluator, SimilarityEvaluator)
Added unit tests verifying proper parameter propagation for both default and explicit values

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 2 comments.

File	Description
sdk/evaluation/azure-ai-evaluation/azure/ai/evaluation/_evaluators/_qa/_qa.py	Added `is_reasoning_model` parameter and forwarded it to all model-based sub-evaluators, excluding F1ScoreEvaluator which performs local computation
sdk/evaluation/azure-ai-evaluation/tests/unittests/test_built_in_evaluator.py	Added two test cases verifying proper parameter propagation with default (False) and explicit (True) values

sdk/evaluation/azure-ai-evaluation/tests/unittests/test_built_in_evaluator.py

github-actions · 2025-12-09T23:19:59Z

API Change Check

APIView identified API level changes in this PR and created the following API reviews

azure-ai-evaluation

nagkumar91 · 2025-12-09T23:23:20Z

@copilot 2025-12-09 23:16:52,740 [INFO] azure-sdk-tools: The package azure-ai-evaluation needs reformat. Run black locally to reformat.

…in_evaluator.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Co-authored-by: nagkumar91 <4727422+nagkumar91@users.noreply.github.com>

Copilot · 2025-12-09T23:30:51Z

@copilot 2025-12-09 23:16:52,740 [INFO] azure-sdk-tools: The package azure-ai-evaluation needs reformat. Run black locally to reformat.

Fixed in commit 363bec4. Applied black formatting to the long lines in the evaluators list by splitting them across multiple lines.

Warning

Firewall rules blocked me from connecting to one or more addresses (expand for details)

I tried to connect to the following addresses, but was blocked by firewall rules:

pypi.org
- Triggering command: /home/REDACTED/work/azure-sdk-for-python/azure-sdk-for-python/.venv/bin/pip pip install black (dns block)
scanning-api.github.com
- Triggering command: /home/REDACTED/work/_temp/ghcca-node/node/bin/node /home/REDACTED/work/_temp/ghcca-node/node/bin/node --enable-source-maps /home/REDACTED/work/_temp/copilot-developer-action-main/dist/index.js (dns block)

If you need me to access, download, or install something from one of these locations, you can either:

Configure Actions setup steps to set up my environment, which run before the firewall is enabled
Add the appropriate URLs or hosts to the custom allowlist in this repository's Copilot coding agent settings (admins only)

Initial plan

ca72869

Copilot AI assigned Copilot and nagkumar91 Dec 9, 2025

Copilot started work on behalf of nagkumar91 December 9, 2025 17:22 View session

Add is_reasoning_model parameter support to QAEvaluator

f5be456

Co-authored-by: nagkumar91 <4727422+nagkumar91@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Fix QAEvaluator to pass through is_reasoning_model parameter~~ Fix QAEvaluator to pass through is_reasoning_model parameter Dec 9, 2025

Copilot AI requested a review from nagkumar91 December 9, 2025 17:38

Copilot finished work on behalf of nagkumar91 December 9, 2025 17:38

Merge branch 'main' into copilot/fix-qa-evaluator-parameter-issue

c1086c5

nagkumar91 marked this pull request as ready for review December 9, 2025 22:51

nagkumar91 requested a review from a team as a code owner December 9, 2025 22:51

Copilot AI review requested due to automatic review settings December 9, 2025 22:51

nagkumar91 added the Evaluation Issues related to the client library for Azure AI Evaluation label Dec 9, 2025

Copilot started reviewing on behalf of nagkumar91 December 9, 2025 22:51 View session

Copilot AI reviewed Dec 9, 2025

View reviewed changes

sdk/evaluation/azure-ai-evaluation/tests/unittests/test_built_in_evaluator.py Outdated Show resolved Hide resolved

sdk/evaluation/azure-ai-evaluation/tests/unittests/test_built_in_evaluator.py Outdated Show resolved Hide resolved

Update sdk/evaluation/azure-ai-evaluation/tests/unittests/test_built_…

89aa785

…in_evaluator.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot started work on behalf of nagkumar91 December 9, 2025 23:24 View session

nagkumar91 and others added 3 commits December 9, 2025 15:24

Update sdk/evaluation/azure-ai-evaluation/tests/unittests/test_built_…

fb69e37

…in_evaluator.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Merge branch 'main' into copilot/fix-qa-evaluator-parameter-issue

bb40608

Apply black formatting to _qa.py evaluators list

363bec4

Co-authored-by: nagkumar91 <4727422+nagkumar91@users.noreply.github.com>

Copilot finished work on behalf of nagkumar91 December 9, 2025 23:32

Merge branch 'main' into copilot/fix-qa-evaluator-parameter-issue

6e28ef4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix QAEvaluator to pass through is_reasoning_model parameter #44342

Fix QAEvaluator to pass through is_reasoning_model parameter #44342

Copilot AI commented Dec 9, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 9, 2025 •

edited

Loading

Uh oh!

nagkumar91 commented Dec 9, 2025

Uh oh!

Copilot AI commented Dec 9, 2025 •

edited

Loading

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix QAEvaluator to pass through is_reasoning_model parameter #44342

Are you sure you want to change the base?

Fix QAEvaluator to pass through is_reasoning_model parameter #44342

Conversation

Copilot AI commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Changes

Usage

All SDK Contribution checklist:

General Guidelines and Best Practices

Testing Guidelines

Comments on the Issue (you are @copilot in this section)

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

API Change Check

Uh oh!

nagkumar91 commented Dec 9, 2025

Uh oh!

Copilot AI commented Dec 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

I tried to connect to the following addresses, but was blocked by firewall rules:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot AI commented Dec 9, 2025 •

edited

Loading

github-actions bot commented Dec 9, 2025 •

edited

Loading

Copilot AI commented Dec 9, 2025 •

edited

Loading