8421BCD / Agentic-RLinks
☆53Updated last week
Alternatives and similar repositories for Agentic-R
Users that are interested in Agentic-R are comparing it to the libraries listed below
Sorting:
- When Reasoning Meets Its Laws☆35Updated last month
- Open-source Agentic RL for LLMs — RLAnything & DemyAgent☆223Updated this week
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last week
- ☆50Updated 11 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆124Updated 7 months ago
- SSRL: Self-Search Reinforcement Learning☆206Updated 5 months ago
- A holistic benchmark for LLM abstention☆69Updated 5 months ago
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆29Updated 3 months ago
- qqr is an RL training framework for open-ended agents.☆205Updated 2 weeks ago
- [NeurIPS'25] Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning☆115Updated last month
- [ACL 2025] Knowledge Unlearning for Large Language Models☆48Updated 4 months ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆112Updated this week
- ☆134Updated last week
- Official implementation of paper "Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models"☆65Updated 3 weeks ago
- ☆43Updated 8 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆59Updated 3 months ago
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Updated last month
- [EMNLP 2025 Main] AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time☆89Updated 7 months ago
- The official repository of paper "Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models''☆110Updated 5 months ago
- [NeurIPS 2025@FoRLM] R1-Compress: Long Chain-of-Thought Compression via Chunk Compression and Search☆17Updated 2 weeks ago
- Process Reward Models That Think☆78Updated 2 months ago
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆44Updated last year
- Verifiers for LLM Reinforcement Learning☆80Updated 9 months ago
- ☆144Updated 9 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆71Updated 8 months ago
- TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25☆88Updated 7 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 3 months ago
- Official Code Release for "Training a Generally Curious Agent"☆44Updated 8 months ago
- Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge☆104Updated last week
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 8 months ago