TIGER-AI-Lab / MMLU-Pro

The code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]
129Updated this week

Related projects

Alternatives and complementary repositories for MMLU-Pro