mazzzystar / TurtleBenchmark

Benchmark for LLM Reasoning & Understanding with Challenging Tasks from Real Users.
103Updated this week

Related projects: