🐑 Run Quantized Agents on AWS Lambda for Cheap 🐥
LLMabda is a simple proxy for llama.cpp server that runs on AWS Lambda.
Information about deployment is in deploy/
- Agent (Using ggml-org/Qwen3-1.7B-GGUF)
- source code
- "What is 5 + (5 * 6)? Please shout the final answer."
- Summarize text (Using ggml-org/Qwen3-1.7B-GGUF)
- source code
- "What happened to the Crystal Extractor?"

