socialfoundations / benchbench

BenchBench is a Python package to evaluate multi-task benchmarks.
12Updated 4 months ago

Related projects

Alternatives and complementary repositories for benchbench