mtbench101 / mt-bench-101View on GitHub
[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
142Jul 24, 2024Updated last year

Alternatives and similar repositories for mt-bench-101

Users that are interested in mt-bench-101 are comparing it to the libraries listed below

Sorting:

Are these results useful?