Releases: radlab-dev-group/llm-router
Releases · radlab-dev-group/llm-router
v0.4.4
Immutable
release. Only release title and notes can be modified.
What's Changed
- Validate unique provider identifiers.
- Store all hosts with keep‑alive configured in a Redis.
- UtilsPlugin pipeline with LangChain based simple RAG plugin (extending context to GenAI with locally built databse).
- Improve dev tooling and enforce request timeouts.
- Refactor API to use unified /v1/responses endpoint.
Full Changelog: https://github.com/radlab-dev-group/llm-router/commits/v0.4.4
v0.4.3
What's Changed
- Helm chart in #20
- Add sample auditor log file to
logs/auditor/directory in #21 - Add Prometheus metrics handler with multiprocess support in #22
- Add configurable monitoring intervals to services monitor in #25
- Skip FastMaskerPlugin in
LLMRouterServicesMonitorhost probing and clean up unused guard‑rail variables in #26 - Add V0 chat handler and integer timestamps to models endpoint in #28
- Add model config docs and tool calling support in #31
- Refactor monitor log prefixes to explicit
[*-monitor]tags and comment out Engine del cleanup in #32 - Add fake model provider and fix guardrail streaming in #33
Full Changelog: v0.4.2...v0.4.3
v0.4.2
Full Changelog: v0.4.1...v0.4.2
v0.4.1
Full Changelog: v0.4.0...v0.4.1
v0.4.0
Full Changelog: v0.3.1...v0.4.0
v0.3.1
Full Changelog: v0.3.0...v0.3.1
v0.3.0
Full Changelog: v0.2.3...v0.3.0
v0.2.3
Full Changelog: v0.2.2...v0.2.3
v0.2.2
Full Changelog: v0.2.1...v0.2.2
v0.2.1
Full Changelog: v0.2.0...v0.2.1