☆287Aug 12, 2025Updated 6 months ago
Alternatives and similar repositories for ReTool
Users that are interested in ReTool are comparing it to the libraries listed below
Sorting:
- ☆444Oct 16, 2025Updated 4 months ago
- ☆335May 24, 2025Updated 9 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,328May 16, 2025Updated 9 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- ☆223Jun 2, 2025Updated 9 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,085Nov 13, 2025Updated 3 months ago
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆319Jan 3, 2026Updated 2 months ago
- ☆115Jun 11, 2025Updated 8 months ago
- A version of verl to support diverse tool use☆879Feb 19, 2026Updated last week
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,522Updated this week
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆358Jan 12, 2026Updated last month
- ☆932Dec 11, 2025Updated 2 months ago
- a survey on deep research☆47Sep 9, 2025Updated 5 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆689Aug 5, 2025Updated 6 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,246Feb 12, 2026Updated 2 weeks ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆705Oct 15, 2025Updated 4 months ago
- R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning☆29Feb 9, 2026Updated 3 weeks ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"☆13Dec 13, 2024Updated last year
- SkyRL: A Modular Full-stack RL Library for LLMs☆1,628Updated this week
- The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.☆421Jul 11, 2025Updated 7 months ago
- CoV: Chain-of-View Prompting for Spatial Reasoning☆51Jan 23, 2026Updated last month
- ☆255Jan 3, 2026Updated 2 months ago
- [EMNLP 2025] WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning☆75Nov 4, 2025Updated 3 months ago
- SimpleDeepSearcher: Deep Information Seeking via Web-Powered Reasoning Trajectory Synthesis☆118Jun 3, 2025Updated 9 months ago
- ☆352Jul 29, 2025Updated 7 months ago
- Code for our paper Resources and Evaluations for Multi-Distribution Dense Information Retrieval☆16Jan 16, 2024Updated 2 years ago
- Kaggle AIMO2 solution with token-efficient reasoning LLM recipes☆42Aug 7, 2025Updated 6 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆510Jun 6, 2025Updated 8 months ago
- DataSciBench: An LLM Agent Benchmark for Data Science☆51Jan 21, 2026Updated last month
- EMNLP MAIN 2025 StepSearch: Igniting LLMs Search Ability via Step-Wise Proximal Policy Optimization☆59Sep 13, 2025Updated 5 months ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆19,519Updated this week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆261May 5, 2025Updated 9 months ago
- A Gym for Agentic LLMs☆452Jan 21, 2026Updated last month
- EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL☆4,649Updated this week
- [EMNLP 2025] Code for paper "Table-R1: Inference-Time Scaling for Table Reasoning"☆29Jun 3, 2025Updated 9 months ago
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆3,586Updated this week
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,739May 11, 2025Updated 9 months ago
- Trinity-RFT is a general-purpose, flexible and scalable framework designed for reinforcement fine-tuning (RFT) of large language models (…☆533Updated this week
- [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning☆998Feb 23, 2026Updated last week