WooooDyy / LLM-Reverse-Curriculum-RL

Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.
β˜†75Updated 9 months ago

Related projects β“˜

Alternatives and complementary repositories for LLM-Reverse-Curriculum-RL