Laoyu84 / 4onebench

A minimalist benchmarking tool designed to test the routine-generation capabilities of LLMs.
15Updated last week

Related projects

Alternatives and complementary repositories for 4onebench