NVlabs / Tool-N1View external linksLinks
☆219Jun 2, 2025Updated 8 months ago
Alternatives and similar repositories for Tool-N1
Users that are interested in Tool-N1 are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…☆21Jan 9, 2024Updated 2 years ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆261May 5, 2025Updated 9 months ago
- ☆432Oct 16, 2025Updated 3 months ago
- ☆334May 24, 2025Updated 8 months ago
- ☆283Aug 12, 2025Updated 6 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Updated this week
- Regularly Truncated M-estimators for Learning with Noisy Labels☆11Apr 24, 2024Updated last year
- A version of verl to support diverse tool use☆868Jan 6, 2026Updated last month
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,317May 16, 2025Updated 8 months ago
- ICCV'2023: Combating Noisy Labels with Sample Selection by Mining High-Discrepancy Examples☆12Oct 16, 2023Updated 2 years ago
- RM-R1: Unleashing the Reasoning Potential of Reward Models☆159Jun 26, 2025Updated 7 months ago
- Verifiers for LLM Reinforcement Learning☆80Apr 15, 2025Updated 9 months ago
- ☆72Jun 10, 2025Updated 8 months ago
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆354Jan 12, 2026Updated last month
- ☆16Sep 27, 2023Updated 2 years ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆96Apr 9, 2025Updated 10 months ago
- [EMNLP'25 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆68Apr 11, 2025Updated 10 months ago
- [NeurIPS 2025] NoisyRollout: Reinforcing Visual Reasoning with Data Augmentation☆104Sep 18, 2025Updated 4 months ago
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆19Oct 14, 2024Updated last year
- The original Shared Recurrent Memory Transformer implementation☆33Jul 11, 2025Updated 7 months ago
- ☆104Dec 6, 2024Updated last year
- ☆263May 14, 2025Updated 9 months ago
- ☆321Sep 18, 2024Updated last year
- [CIKM-2024] Official code for work "ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance"☆19Aug 14, 2024Updated last year
- Official Implementation for the paper "Discovering Knowledge Deficiencies of Language Models on Massive Knowledge Base"☆27Sep 2, 2025Updated 5 months ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 5 months ago
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Oct 3, 2024Updated last year
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,571Updated this week
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,511Jan 25, 2026Updated 3 weeks ago
- [ICLR 2026] Efficient Agent Training for Computer Use☆137Sep 5, 2025Updated 5 months ago
- NeurIPS'2022: Pluralistic Image Completion with Gaussian Mixture Models☆14Jan 28, 2023Updated 3 years ago
- ☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models☆19Jun 4, 2025Updated 8 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆699Oct 15, 2025Updated 3 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Aug 9, 2023Updated 2 years ago
- ☆813Jun 9, 2025Updated 8 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆125Jun 11, 2025Updated 8 months ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆120May 6, 2025Updated 9 months ago
- Evaluation framework for paper "VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?"☆63Oct 19, 2024Updated last year
- Training VLM agents with multi-turn reinforcement learning☆404Updated this week