open-compass / CriticEvalView on GitHub
[NeurIPS 2024] A comprehensive benchmark for evaluating critique ability of LLMs
49Nov 29, 2024Updated last year

Alternatives and similar repositories for CriticEval

Users that are interested in CriticEval are comparing it to the libraries listed below

Sorting:

Are these results useful?