[V2][RFC][wip] make room for batch eval #1007

jonathanlastmileai · 2024-01-23T23:01:12Z

[V2][RFC][wip] make room for batch eval

This moves the existing eval library to "test_suite_eval" and starts the equivalent
for batch runs. Also makes the interface a little clearer.

Essentially, the differences are:

each metric runs on a list of inputs, not just one
each input can be paired with a reference. This is possible in the "test suite"
setup, but it is clunkier.

This moves the existing eval library to "test_suite_eval" and starts the equivalent for batch runs. Also makes the interface a little clearer. Essentially, the differences are: - each metric runs on a _list_ of inputs, not just one - each input can be paired with a reference. This is possible in the "test suite" setup, but it is clunkier.

jonathanlastmileai force-pushed the pr1007 branch from a62f90f to 402d73d Compare January 23, 2024 23:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[V2][RFC][wip] make room for batch eval #1007

[V2][RFC][wip] make room for batch eval #1007

Uh oh!

jonathanlastmileai commented Jan 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[V2][RFC][wip] make room for batch eval #1007

Are you sure you want to change the base?

[V2][RFC][wip] make room for batch eval #1007

Uh oh!

Conversation

jonathanlastmileai commented Jan 23, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants