|
| 1 | +# Resources |
| 2 | +## Benchmark papers |
| 3 | +* m&m’s: A Benchmark to Evaluate Tool-Use for multi-step multi-modal Tasks |
| 4 | + * multi-modal |
| 5 | +* AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding Agents |
| 6 | +* TOOLLLM: FACILITATING LARGE LANGUAGE MODELS TO MASTER 16000+ REAL-WORLD APIS |
| 7 | + * toolbench |
| 8 | +* Gorilla: Large Language Model Connected with Massive APIs |
| 9 | + * code output |
| 10 | +* AGENTVERSE: FACILITATING MULTI-AGENT COLLABORATION AND EXPLORING EMERGENT BEHAVIORS |
| 11 | +* CRAFT: CUSTOMIZING LLMS BY CREATING AND RETRIEVING FROM SPECIALIZED TOOLSETS |
| 12 | +* α-UMi: Small LLMs Are Weak Tool Learners: A Multi-LLM Agent |
| 13 | +* AnyTool: Self-Reflective, Hierarchical Agents for Large-Scale API Calls |
| 14 | +* WISSNYF: TOOL GROUNDED LLM AGENTS FOR BLACK BOX SETTING |
| 15 | +* τ -bench: A Benchmark for Tool-Agent-User Interaction in Real-World Domains |
| 16 | + |
| 17 | +## Papers |
| 18 | +* GPT4Tools: Teaching Large Language Model to Use Tools via Self-instruction |
| 19 | +* AI Agents That Matter |
| 20 | + |
| 21 | +## Autogen |
| 22 | +* groupchat, nestedchat example |
| 23 | + * https://microsoft.github.io/autogen/docs/tutorial/conversation-patterns/#two-agent-chat-and-chat-result |
| 24 | + * https://microsoft.github.io/autogen/docs/notebooks/agentchat_nestedchat/ |
| 25 | +* Custom llm + function calling |
| 26 | + * https://microsoft.github.io/autogen/docs/topics/non-openai-models/local-litellm-ollama/ |
| 27 | + * https://www.reddit.com/r/ollama/comments/1bacf8c/anyone_had_success_with_function_calling_its/ |
| 28 | + * https://www.reddit.com/r/AutoGenAI/comments/1b57l2t/trying_to_get_autogen_to_work_with_ollama_and/ |
| 29 | + * https://github.com/marklysze/AutoGenCodeTesting/blob/master/function_calling/function_calling_test.py |
| 30 | + * Autogen discord (alt-models channel) - https://discord.com/channels/1153072414184452236/1201369716057440287 |
| 31 | +* Articles on autoen function calling |
| 32 | + * https://medium.com/@sanjuvenky246/-3c15bfa077da |
| 33 | + * https://medium.com/@coldstart_coder/autogen-essentials-function-integration-for-smarter-agents-7c3b4a0fdc12 |
| 34 | + |
| 35 | +## Code & Data |
| 36 | +* [Toolbench](https://github.com/OpenBMB/ToolBench?tab=readme-ov-file#model) |
| 37 | +* [Stable Toolbench](https://github.com/THUNLP-MT/StableToolBench) |
| 38 | +* [APIGen from HF](https://huggingface.co/datasets/Salesforce/xlam-function-calling-60k) |
| 39 | + |
| 40 | +## Survey |
| 41 | +* [LLM tool survey](https://github.com/quchangle1/LLM-Tool-Survey) |
| 42 | + |
| 43 | +## Etc |
| 44 | +* [Openai function calling](https://platform.openai.com/docs/guides/function-calling) |
0 commit comments