8421BCD / Agentic-RLinks
☆36Updated this week
Alternatives and similar repositories for Agentic-R
Users that are interested in Agentic-R are comparing it to the libraries listed below
Sorting:
- Official repository for ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use☆26Updated 2 months ago
- ☆47Updated 3 months ago
- ☆70Updated 7 months ago
- Official Repository of Native Parallel Reasoner☆98Updated last week
- SSRL: Self-Search Reinforcement Learning☆204Updated 5 months ago
- A Recipe for Building LLM Reasoners to Solve Complex Instructions☆29Updated 3 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆124Updated 7 months ago
- JudgeLRM: Large Reasoning Models as a Judge☆40Updated last month
- Klear-Reasoner: Advancing Reasoning Capability via Gradient-Preserving Clipping Policy Optimization☆81Updated last month
- [NeurIPS 2025 D&B (Spotlight🌟)] TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenario☆29Updated 3 months ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆56Updated last month
- ☆50Updated 11 months ago
- [ACL 2025] A Generalizable and Purely Unsupervised Self-Training Framework☆71Updated 7 months ago
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆43Updated last year
- From Word to World: Can Large Language Models be Implicit Text-based World Models?☆36Updated last month
- ☆45Updated 7 months ago
- Official Repo for SwS: A Weakness-driven Problem Synthesis Framework in RL for LLM Reasoning☆42Updated 2 months ago
- ☆71Updated 3 months ago
- Process Reward Models That Think☆77Updated last month
- [NeurIPS 2025] Let LRMs Break Free from Overthinking via Self-Braking Tuning. https://arxiv.org/abs/2505.14604☆55Updated 2 months ago
- Scaling Agentic Environments Automatically.☆45Updated last month
- ☆43Updated 5 months ago
- Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"☆56Updated 3 weeks ago
- ☆57Updated 2 weeks ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆70Updated 8 months ago
- Code for Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation (EVOL-RL).☆42Updated 3 months ago
- Revisiting Mid-training in the Era of Reinforcement Learning Scaling☆182Updated 6 months ago
- Emergent Hierarchical Reasoning in LLMs/VLMs through Reinforcement Learning☆57Updated 3 months ago
- [NeurIPS 2025] The official repo of SynLogic: Synthesizing Verifiable Reasoning Data at Scale for Learning Logical Reasoning and Beyond☆190Updated 6 months ago
- MegaScience: Pushing the Frontiers of Post-Training Datasets for Science Reasoning☆111Updated 2 months ago