NVlabs/Tool-N1

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVlabs/Tool-N1)

NVlabs / Tool-N1

☆231

Alternatives and similar repositories for Tool-N1

Users that are interested in Tool-N1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

qiancheng0 / ToolRL
View on GitHub
☆514Oct 16, 2025Updated 9 months ago
xiaoboxia / PICMM
View on GitHub
NeurIPS'2022: Pluralistic Image Completion with Gaussian Mixture Models
☆14Jan 28, 2023Updated 3 years ago
eraseai / erase
View on GitHub
[CIKM-2024] Official code for work "ERASE: Error-Resilient Representation Learning on Graphs for Label Noise Tolerance"
☆20Aug 14, 2024Updated last year
xiaoboxia / CoDis
View on GitHub
ICCV'2023: Combating Noisy Labels with Sample Selection by Mining High-Discrepancy Examples
☆12Oct 16, 2023Updated 2 years ago
ltzheng / SimpleTIR
View on GitHub
[ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning
☆401Mar 30, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
TIGER-AI-Lab / verl-tool
View on GitHub
A version of verl to support diverse tool use [TMLR 2026]
☆1,026Jul 15, 2026Updated last week
GAIR-NLP / ToRL
View on GitHub
☆352May 24, 2025Updated last year
Hao-tian-Zheng / ATOL
View on GitHub
☆17Dec 7, 2023Updated 2 years ago
GAIR-NLP / OctoThinker
View on GitHub
Revisiting Mid-training in the Era of Reinforcement Learning Scaling
☆189Jul 23, 2025Updated last year
chenchen0103 / ACEBench
View on GitHub
☆188Oct 29, 2025Updated 9 months ago
bespokelabsai / verifiers
View on GitHub
Verifiers for LLM Reinforcement Learning
☆81Jul 17, 2026Updated last week
HKUNLP / critic-rl
View on GitHub
[ICML 2025] Teaching Language Models to Critique via Reinforcement Learning
☆127May 6, 2025Updated last year
Agent-RL / ReCall
View on GitHub
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…
☆1,425May 16, 2025Updated last year
facebookresearch / sweet_rl
View on GitHub
Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks
☆271May 5, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mll-lab-nu / RAGEN
View on GitHub
RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.
☆2,757Updated this week
ritikamangla / QSalience
View on GitHub
https://arxiv.org/abs/2404.10917
☆14Mar 18, 2025Updated last year
ChenxinAn-fdu / POLARIS
View on GitHub
Scaling RL on advanced reasoning models
☆692Oct 20, 2025Updated 9 months ago
xiaoboxia / RTM_LNL
View on GitHub
Regularly Truncated M-estimators for Learning with Noisy Labels
☆11Apr 24, 2024Updated 2 years ago
SalesforceAIResearch / PretrainRL-pipeline
View on GitHub
An automated data pipeline scaling RL to pretraining levels
☆76Jun 2, 2026Updated last month
yongchao98 / R1-Code-Interpreter
View on GitHub
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning
☆45Feb 9, 2026Updated 5 months ago
inclusionAI / AWorld-RL
View on GitHub
Agentic Learning Powered by AWorld
☆119Jun 18, 2026Updated last month
ReTool-RL / ReTool
View on GitHub
☆387Aug 12, 2025Updated 11 months ago
RM-R1-UIUC / RM-R1
View on GitHub
[ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models
☆167Jun 26, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
NovaSky-AI / SkyRL
View on GitHub
SkyRL: A Modular Full-stack RL Library for LLMs
☆2,102Updated this week
Simple-Efficient / RL-Factory
View on GitHub
Train your Agent model via our easy and efficient framework
☆1,773Dec 5, 2025Updated 7 months ago
PeterGriffinJin / Search-R1
View on GitHub
Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL
☆5,170Nov 13, 2025Updated 8 months ago
tmlr-group / RGIB
View on GitHub
[NeurIPS 2023] "Combating Bilateral Edge Noise for Robust Link Prediction"
☆11Nov 3, 2023Updated 2 years ago
NVIDIA-NeMo / RL
View on GitHub
Scalable toolkit for efficient model reinforcement
☆1,855Updated this week
RUC-NLPIR / ARPO
View on GitHub
[ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)
☆1,093Jul 13, 2026Updated 2 weeks ago
GAIR-NLP / DeepResearcher
View on GitHub
Scaling Deep Research via Reinforcement Learning in Real-world Environments.
☆784May 10, 2026Updated 2 months ago
TIGER-AI-Lab / General-Reasoner
View on GitHub
General Reasoner: Advancing LLM Reasoning Across All Domains [NeurIPS25]
☆229Nov 27, 2025Updated 8 months ago
ChengpengLi1003 / CoRT
View on GitHub
☆72Oct 23, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
langfengQ / verl-agent
View on GitHub
verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…
☆2,158Jun 9, 2026Updated last month
skzhang1 / IDEAL
View on GitHub
IDEAL: Influence-Driven Selective Annotations Empower In-Context Learners in Large Language Models
☆59Jan 19, 2024Updated 2 years ago
ByteDance-Seed / Seed-1.8
View on GitHub
☆219Dec 19, 2025Updated 7 months ago
llyx97 / sparse-and-robust-PLM
View on GitHub
[NeurIPS 2022] "A Win-win Deal: Towards Sparse and Robust Pre-trained Language Models", Yuanxin Liu, Fandong Meng, Zheng Lin, Jiangnan Li…
☆21Jan 9, 2024Updated 2 years ago
zorazrw / agent-skill-induction
View on GitHub
Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"
☆42Apr 24, 2025Updated last year
huggingface / Math-Verify
View on GitHub
☆1,172Jan 10, 2026Updated 6 months ago
test-time-interaction / TTI
View on GitHub
☆76Jun 10, 2025Updated last year