sierra-research / tau2-benchView on GitHub
τ²-Bench: Evaluating Conversational Agents in a Dual-Control Environment
800Feb 11, 2026Updated 3 weeks ago

Alternatives and similar repositories for tau2-bench

Users that are interested in tau2-bench are comparing it to the libraries listed below

Sorting:

Are these results useful?