TheDuckAI / arb

Advanced Reasoning Benchmark Dataset for LLMs
45Updated 10 months ago

Related projects: