Laoyu84 / 4onebench

A minimalist benchmarking tool designed to test the routine-generation capabilities of LLMs.
17Updated 2 weeks ago

Related projects

Alternatives and complementary repositories for 4onebench