GAIR-NLP / scaleeval

Scalable Meta-Evaluation of LLMs as Evaluators
39Updated 7 months ago

Related projects: