liziniu / ReMax

Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
150Updated 10 months ago

Related projects

Alternatives and complementary repositories for ReMax