zhentingqi / rStarLinks
☆964Updated 9 months ago
Alternatives and similar repositories for rStar
Users that are interested in rStar are comparing it to the libraries listed below
Sorting:
- Large Reasoning Models☆805Updated 10 months ago
- A series of technical report on Slow Thinking with LLM☆744Updated 2 months ago
- ☆981Updated 3 months ago
- [ICLR 2025] Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing. Your efficient and high-quality synthetic data …☆782Updated 7 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆674Updated 9 months ago
- OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models☆1,823Updated 9 months ago
- [COLM 2025] LIMO: Less is More for Reasoning☆1,038Updated 3 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆924Updated 8 months ago
- Code for Quiet-STaR☆739Updated last year
- ☆548Updated 10 months ago
- Scalable RL solution for advanced reasoning of language models☆1,755Updated 7 months ago
- O1 Replication Journey☆2,003Updated 9 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,524Updated 5 months ago
- ☆1,349Updated 11 months ago
- AN O1 REPLICATION FOR CODING☆336Updated 10 months ago
- Recipes to scale inference-time compute of open models☆1,114Updated 5 months ago
- FuseAI Project☆583Updated 9 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆467Updated 4 months ago
- ☆1,035Updated 10 months ago
- LongBench v2 and LongBench (ACL 25'&24')☆1,005Updated 9 months ago
- OLMoE: Open Mixture-of-Experts Language Models☆898Updated last month
- ☆1,320Updated last month
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆649Updated 2 months ago
- ☆342Updated 4 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆631Updated 2 weeks ago
- [NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models☆668Updated 4 months ago
- RewardBench: the first evaluation tool for reward models.☆646Updated 4 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux Series - ReasonFlux, ReasonFlux-PRM and ReasonFlux-Coder☆494Updated last month
- ☆749Updated last month
- Understanding R1-Zero-Like Training: A Critical Perspective☆1,132Updated 2 months ago