A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.
☆221Apr 15, 2025Updated 11 months ago
Alternatives and similar repositories for StableToolBench
Users that are interested in StableToolBench are comparing it to the libraries listed below
Sorting:
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆73May 13, 2025Updated 10 months ago
- [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.☆5,559May 21, 2025Updated 9 months ago
- [ICLR'24] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use