RUC-NLPIR / Tool-StarView external linksLinks
🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning
☆317Jan 3, 2026Updated last month
Alternatives and similar repositories for Tool-Star
Users that are interested in Tool-Star are comparing it to the libraries listed below
Sorting:
- ☆23Jan 9, 2026Updated last month
- [ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)☆881Jan 28, 2026Updated 2 weeks ago
- ☆16Sep 17, 2024Updated last year
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆35Oct 9, 2025Updated 4 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆32Jul 25, 2025Updated 6 months ago
- [ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection☆25Jun 27, 2025Updated 7 months ago
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆25May 29, 2025Updated 8 months ago
- The demo, code and data of FollowRAG☆75Jun 30, 2025Updated 7 months ago
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆354Jan 12, 2026Updated last month
- A version of verl to support diverse tool use☆868Jan 6, 2026Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆261May 5, 2025Updated 9 months ago
- ☆283Aug 12, 2025Updated 6 months ago
- RAG methods, benchmarks, and toolkits☆19Nov 28, 2024Updated last year
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆63Oct 23, 2025Updated 3 months ago
- ☆67Aug 14, 2025Updated 6 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,317May 16, 2025Updated 8 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 3 months ago
- 🏆 Official implementation of LangCoop: Collaborative Driving with Natural Language☆76Sep 12, 2025Updated 5 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,511Jan 25, 2026Updated 2 weeks ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Aug 7, 2025Updated 6 months ago
- ☆14Dec 18, 2024Updated last year
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆22May 31, 2025Updated 8 months ago
- ☆23Jul 29, 2025Updated 6 months ago
- ☆19Mar 10, 2025Updated 11 months ago
- ☆138Nov 17, 2025Updated 2 months ago
- ☆58Feb 27, 2025Updated 11 months ago
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,164Nov 17, 2025Updated 2 months ago
- ☆17Aug 1, 2025Updated 6 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆3,975Nov 13, 2025Updated 3 months ago
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- ☆11Aug 13, 2024Updated last year
- ☆21Nov 27, 2025Updated 2 months ago
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆44Jul 10, 2025Updated 7 months ago
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆33Aug 12, 2025Updated 6 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Jan 8, 2026Updated last month
- Official code for DeepSound-V1☆13May 14, 2025Updated 9 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆24Oct 7, 2025Updated 4 months ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆53Aug 28, 2025Updated 5 months ago
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆64May 21, 2025Updated 8 months ago