ChenxinAn-fdu / POLARISLinks
Scaling RL on advanced reasoning models
☆632Updated last month
Alternatives and similar repositories for POLARIS
Users that are interested in POLARIS are comparing it to the libraries listed below
Sorting:
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆451Updated 6 months ago
- [NeurIPS 2025 Spotlight] ReasonFlux (long-CoT), ReasonFlux-PRM (process reward model) and ReasonFlux-Coder (code generation)☆501Updated last month
- An Open-Source Large-Scale Reinforcement Learning Project for Search Agents☆492Updated last month
- [NeurIPS 2025] Reinforcement Learning for Reasoning in Large Language Models with One Training Example☆376Updated last month
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆274Updated 9 months ago
- ☆326Updated 5 months ago
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆273Updated last month
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆257Updated 6 months ago
- ☆268Updated 2 months ago
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆785Updated 3 months ago
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆495Updated 2 months ago
- Code for the paper: "Learning to Reason without External Rewards"☆375Updated 4 months ago
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆186Updated 4 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆250Updated 6 months ago
- ☆817Updated 5 months ago
- General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]☆202Updated 3 weeks ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆268Updated last month
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.☆490Updated 2 months ago
- ☆301Updated 5 months ago
- ☆231Updated 3 months ago
- A version of verl to support diverse tool use☆701Updated this week
- Tina: Tiny Reasoning Models via LoRA☆305Updated 2 months ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆318Updated 2 months ago
- ☆172Updated 6 months ago
- ☆341Updated 3 months ago
- ☆212Updated 9 months ago
- 📖 This is a repository for organizing papers, codes, and other resources related to Latent Reasoning.☆277Updated 2 weeks ago
- Official implementation of the NeurIPS 2025 paper "Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space"☆277Updated last week
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆282Updated last month
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆477Updated 5 months ago