☆223Jun 2, 2025Updated 9 months ago
Alternatives and similar repositories for Tool-N1
Users that are interested in Tool-N1 are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…☆21Jan 9, 2024Updated 2 years ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆263May 5, 2025Updated 10 months ago
- ☆293Aug 12, 2025Updated 6 months ago
- ☆449Oct 16, 2025Updated 4 months ago
- ☆335May 24, 2025Updated 9 months ago
- Regularly Truncated M-estimators for Learning with Noisy Labels☆11Apr 24, 2024Updated last year
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,338May 16, 2025Updated 9 months ago
- A version of verl to support diverse tool use☆889Updated this week
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 3 weeks ago
- ICCV'2023: Combating Noisy Labels with Sample Selection by Mining High-Discrepancy Examples☆12Oct 16, 2023Updated 2 years ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆159Jun 26, 2025Updated 8 months ago
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated 10 months ago
- Scalable toolkit for efficient model reinforcement☆1,372Updated this week
- ☆72Jun 10, 2025Updated 8 months ago
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆358Jan 12, 2026Updated last month
- Optimizing Anytime Reasoning via Budget Relative Policy Optimization☆52Jul 15, 2025Updated 7 months ago
- ☆16Sep 27, 2023Updated 2 years ago
- [CVPR 2025] VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning☆14Jun 7, 2025Updated 9 months ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆98Apr 9, 2025Updated 10 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Apr 11, 2025Updated 10 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆106Sep 18, 2025Updated 5 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- ☆104Dec 6, 2024Updated last year
- ☆265May 14, 2025Updated 9 months ago
- ☆320Sep 18, 2024Updated last year
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆49Updated this week
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- Official Implementation for the paper "Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base"☆27Sep 2, 2025Updated 6 months ago
- [CIKM-2024] Official code for work "ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance"☆19Aug 14, 2024Updated last year
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Oct 3, 2024Updated last year
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,527Feb 27, 2026Updated last week
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,656Updated this week
- [ICLR 2026] Efficient Agent Training for Computer Use☆138Sep 5, 2025Updated 6 months ago
- NeurIPS'2022: Pluralistic Image Completion with Gaussian Mixture Models☆14Jan 28, 2023Updated 3 years ago
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Jun 4, 2025Updated 9 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Aug 9, 2023Updated 2 years ago
- ☆813Jun 9, 2025Updated 8 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆709Oct 15, 2025Updated 4 months ago