casmlab / NPHardEval

Repository for NPHardEval, a quantified-dynamic benchmark of LLMs
48Updated 7 months ago

Related projects

Alternatives and complementary repositories for NPHardEval