PRIME-RL / PRIME
Scalable RL solution for advanced reasoning of language models
☆1,410Updated this week
Alternatives and similar repositories for PRIME:
Users that are interested in PRIME are comparing it to the libraries listed below
- Large Reasoning Models☆799Updated 3 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,475Updated 2 weeks ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,727Updated 2 months ago
- ☆908Updated 2 months ago
- A series of technical report on Slow Thinking with LLM☆581Updated this week
- Recipes to scale inference-time compute of open models☆1,041Updated 3 weeks ago
- ☆1,347Updated 4 months ago
- O1 Replication Journey☆1,977Updated 2 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆850Updated last month
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆659Updated this week
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆595Updated 2 months ago
- DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models☆1,599Updated last year
- ☆1,011Updated 3 months ago
- ☆504Updated 2 months ago