-
Notifications
You must be signed in to change notification settings - Fork 2.9k
Description
Problem (one or two sentences)
When doing something like reading or writing a file, or other less intensive tasks it is useful to use cheaper models. Now we can set profiles, but to be able to dynamically choose based on some constraints could reduce costs by a lot.
Context (who is affected and when)
Everyone. Right now simple mode / profile choices reduces costs, esp with orchestrator. But there can be a lot of other sub-tasks which can be done by an even cheaper model.
Desired behavior (conceptual, not technical)
Auto routing for the model.
Constraints / preferences (optional)
Configurable to some extent so that we can decide what model/temp we want in which use case.
For example we could have some dimensions, like:
- ambiguity of request ( is the ask clear, does it require clarification or investigation)
- size of request (does it need to be broken down to complete)
- determinism of request (debugging, refactoring, might need higher temp; info gathering may be lower size)
- more?
Request checklist
- I've searched existing Issues and Discussions for duplicates
- This describes a specific problem with clear context and impact
Roo Code Task Links (optional)
No response
Acceptance criteria (optional)
GIVEN an agent requires to do a simple task which can easily be done with less thinking
WHEN executing the task
THEN the agent will choose an apt model for the task, thereby reducing costs and improving the end result
Proposed approach (optional)
No response
Trade-offs / risks (optional)
No response