Skip to content

Conversation

@poshinchen
Copy link
Contributor

Description

  1. Add ResponseRelevanceEvaluator to assess the relevance of the LLM response to the question, in other words, how focused the LLM response is on the given question.
  2. Implement 5-level scoring system (Not At All, Not Generally, Neutral/Mixed, Generally Yes, Completely Yes)

Related Issues

#100

Documentation PR

WIP

Type of Change

  1. New feature ("ResponseRelevance Evaulator")

Notes

  • Need to run evaluations via agentcore online evaluator and compare the scores.

Testing

How have you tested the change? Verify that the changes do not break functionality or introduce warnings in consuming repositories: agents-docs, agents-tools, agents-cli

  • I ran hatch run prepare

Checklist

  • I have read the CONTRIBUTING document
  • I have added any necessary tests that prove my fix is effective or my feature works
  • I have updated the documentation accordingly
  • I have added an appropriate example to the documentation to outline the feature, or no new docs are needed
  • My changes generate no new warnings
  • Any dependent changes have been merged and published

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

@poshinchen poshinchen changed the title (WIP) feat: added ResponseRelevanceEvaluator feat: added ResponseRelevanceEvaluator Jan 30, 2026
@poshinchen poshinchen force-pushed the feat/response-relevance-evaluator branch from f616f3d to 190c984 Compare February 3, 2026 17:22
@poshinchen poshinchen force-pushed the feat/response-relevance-evaluator branch from 190c984 to c51f857 Compare February 3, 2026 19:24
@poshinchen poshinchen merged commit 41ff8cc into strands-agents:main Feb 3, 2026
12 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants