🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning
☆320Jan 3, 2026Updated 2 months ago
Alternatives and similar repositories for Tool-Star
Users that are interested in Tool-Star are comparing it to the libraries listed below
Sorting:
- ☆23Jan 9, 2026Updated last month
- [ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)☆892Jan 28, 2026Updated last month
- ☆16Sep 17, 2024Updated last year
- OmniGAIA: Towards Native Omni-Modal AI Agents☆46Updated this week
- HierSearch: A Hierarchical Enterprise Deep Search Framework Integrating Local and Web Searches☆37Oct 9, 2025Updated 4 months ago
- Code repo for "Harnessing Negative Signals: Reinforcement Distillation from Teacher Data for LLM Reasoning"☆33Jul 25, 2025Updated 7 months ago
- [ICML2025] Official Code of From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection☆25Jun 27, 2025Updated 8 months ago
- This repository contains the code for our ICML 2025 paper——LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection🎉☆26May 29, 2025Updated 9 months ago
- The demo, code and data of FollowRAG☆75Jun 30, 2025Updated 8 months ago
- [ICLR 2026] End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆358Jan 12, 2026Updated last month
- A version of verl to support diverse tool use☆889Updated this week
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆261May 5, 2025Updated 10 months ago
- RAG methods, benchmarks, and toolkits☆19Nov 28, 2024Updated last year
- ☆293Aug 12, 2025Updated 6 months ago
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆63Oct 23, 2025Updated 4 months ago
- ☆67Aug 14, 2025Updated 6 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Rei…☆1,328May 16, 2025Updated 9 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 4 months ago
- 🏆 Official implementation of LangCoop: Collaborative Driving with Natural Language☆77Sep 12, 2025Updated 5 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,522Feb 27, 2026Updated last week
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Aug 7, 2025Updated 6 months ago
- VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection☆25May 31, 2025Updated 9 months ago
- ☆26Jul 29, 2025Updated 7 months ago
- ☆14Dec 18, 2024Updated last year
- ☆19Mar 10, 2025Updated 11 months ago
- ☆139Nov 17, 2025Updated 3 months ago
- ☆58Feb 27, 2025Updated last year
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,178Nov 17, 2025Updated 3 months ago
- ☆17Aug 1, 2025Updated 7 months ago
- Search-R1: An Efficient, Scalable RL Training Framework for Reasoning & Search Engine Calling interleaved LLM based on veRL☆4,085Nov 13, 2025Updated 3 months ago
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- ☆21Nov 27, 2025Updated 3 months ago
- ☆11Aug 13, 2024Updated last year
- [ICML 2025] This is the official PyTorch implementation of "🎵 HarmoniCa: Harmonizing Training and Inference for Better Feature Caching i…☆45Jul 10, 2025Updated 7 months ago
- Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments☆48Jan 8, 2026Updated last month
- [ACL'25] UTBoost: Rigorous Evaluation of Coding Agents on SWE-Bench☆35Aug 12, 2025Updated 6 months ago
- Official code for DeepSound-V1☆13May 14, 2025Updated 9 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 4 months ago
- 🔍 Awesome Agentic Search is a curated list of papers, tools, and resources on agentic search—where AI agents plan, search, and reason to…☆54Aug 28, 2025Updated 6 months ago