Junjie-Ye / RoTBench

[EMNLP 2024] RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning
13Updated 5 months ago

Alternatives and similar repositories for RoTBench:

Users that are interested in RoTBench are comparing it to the libraries listed below