lfy79001 / S3Eval

[NAACL 2024] A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models
33Updated 5 months ago

Related projects

Alternatives and complementary repositories for S3Eval