KwanWaiChung / MT-Eval

Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"
31Updated last month

Related projects

Alternatives and complementary repositories for MT-Eval