KwanWaiChung / MT-Eval

Code and data for "MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models"
28Updated 6 months ago

Related projects: