☆36May 24, 2025Updated last year
Alternatives and similar repositories for StepTool
Users that are interested in StepTool are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- UnOfficial Gradio Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Y…☆16Sep 30, 2024Updated last year
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆30Aug 9, 2025Updated 10 months ago
- (ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation☆12May 21, 2025Updated last year
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆235Apr 15, 2025Updated last year
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆28Mar 2, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆29Jun 5, 2025Updated last year
- arXiv 2024 | ZIP: entropy-law data selection for efficient LLM alignment.☆28Updated this week
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆206Apr 17, 2025Updated last year
- ☆39May 2, 2024Updated 2 years ago
- A dataset for training and evaluating LLMs on decision making about "when (not) to call" functions☆64Apr 29, 2025Updated last year
- A Pytorch implementation of Collaborative Metric Learning (CML)☆11Oct 13, 2020Updated 5 years ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆68Oct 18, 2024Updated last year
- ☆15Feb 21, 2024Updated 2 years ago
- ☆36May 24, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [NeurIPS D&B Track 2024] Source code for the paper "Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge…☆25May 2, 2025Updated last year
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆69Jan 28, 2026Updated 4 months ago
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆18Oct 17, 2023Updated 2 years ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆84Jan 14, 2025Updated last year
- ☆187Oct 29, 2025Updated 7 months ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆44Mar 14, 2024Updated 2 years ago
- ☆13Jan 14, 2026Updated 5 months ago
- A user-friendly & efficient knowledge distillation framework for LLMs, supporting off-policy, on-policy (OPD), cross-tokenizer, multimoda…☆195Jun 5, 2026Updated last week
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆35Jun 29, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆71Jun 13, 2025Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- 清华大学研究生社会实践系统爬虫☆17Jun 4, 2024Updated 2 years ago
- ☆503Oct 16, 2025Updated 7 months ago
- [NeurIPS 2024 D&B] GTA: A Benchmark for General Tool Agents & [arXiv 2026] GTA-2☆145Apr 20, 2026Updated last month
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆164Oct 30, 2024Updated last year
- ☆32May 8, 2025Updated last year
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆47Dec 17, 2025Updated 5 months ago
- Source code for our paper: "LoGU: Long-form Generation with Uncertainty Expressions".☆19May 27, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆13May 13, 2025Updated last year
- ☆922Jul 24, 2024Updated last year
- Code for Findings of ACL 2021 paper "Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain …☆19Dec 16, 2022Updated 3 years ago
- VehicleWorld is the first comprehensive multi-device environment for intelligent vehicle interaction that accurately models the complex, …☆22Sep 16, 2025Updated 8 months ago
- This is the repository for the Tool Learning survey.☆484Aug 9, 2025Updated 10 months ago
- ☆29Aug 25, 2024Updated last year
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Jan 6, 2023Updated 3 years ago