ConsequentAI / fneval

Functional Benchmarks and the Reasoning Gap
78Updated last month

Related projects

Alternatives and complementary repositories for fneval