shehper / AC-Solver

A long-horizon, sparse-reward math environment for reinforcement learning. Official code repo for "What makes Math problems hard for reinforcement learning: A case study".
14Updated 3 weeks ago

Related projects

Alternatives and complementary repositories for AC-Solver