night-chen / ToolQA

ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.
251Updated last year

Alternatives and similar repositories for ToolQA:

Users that are interested in ToolQA are comparing it to the libraries listed below