PRIME-RL / TTRLLinks
TTRL: Test-Time Reinforcement Learning
☆704Updated 3 weeks ago
Alternatives and similar repositories for TTRL
Users that are interested in TTRL are comparing it to the libraries listed below
Sorting:
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,438Updated 2 months ago
- ☆585Updated 3 months ago
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,023Updated 2 weeks ago
- Awesome RL Reasoning Recipes ("Triple R")☆745Updated last month
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆323Updated this week
- Large Reasoning Models☆805Updated 7 months ago
- ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling☆447Updated 2 weeks ago
- A series of technical report on Slow Thinking with LLM☆708Updated last month
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆244Updated 2 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆652Updated 5 months ago
- ☆270Updated last month
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆267Updated 4 months ago
- Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models☆509Updated 2 weeks ago
- ☆186Updated this week
- [COLM 2025] LIMO: Less is More for Reasoning☆980Updated last week
- Explore the Multimodal “Aha Moment” on 2B Model☆596Updated 3 months ago
- Official Implementation for the paper "d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning"☆240Updated 2 weeks ago
- Awesome RL-based LLM Reasoning☆561Updated 2 months ago
- Official implementation of paper: SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training☆284Updated 2 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆637Updated last month
- ☆824Updated 2 weeks ago
- ☆304Updated last month
- ☆241Updated last month
- SkyRL: A Modular Full-stack RL Library for LLMs☆603Updated this week
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆588Updated last month
- Latest Advances on System-2 Reasoning☆1,176Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆228Updated 2 months ago
- Official Repository of "Learning to Reason under Off-Policy Guidance"☆251Updated last month
- repo for paper https://arxiv.org/abs/2504.13837☆173Updated 2 weeks ago
- Official Repo for Open-Reasoner-Zero☆1,990Updated last month