lfy79001 / S3Eval

A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models
32Updated 4 months ago

Related projects

Alternatives and complementary repositories for S3Eval