open-compass / CriticEval

[NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs
29Updated last week

Related projects

Alternatives and complementary repositories for CriticEval