☆32May 24, 2025Updated 9 months ago
Alternatives and similar repositories for StepTool
Users that are interested in StepTool are comparing it to the libraries listed below
Sorting:
- FamilyTool benchmark☆12Sep 10, 2025Updated 5 months ago
- (ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation☆12May 21, 2025Updated 9 months ago
- UnOfficial Gradio Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Y…☆16Sep 30, 2024Updated last year
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)☆16Jul 2, 2024Updated last year
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆18Oct 17, 2023Updated 2 years ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- Resources and paper list for 'Scaling Environments for Agents'. This repository accompanies our survey on how environments contribute to …☆62Jan 28, 2026Updated last month
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆202Apr 17, 2025Updated 10 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆53Jun 6, 2025Updated 8 months ago
- ☆27Jul 11, 2024Updated last year
- ☆28Aug 25, 2024Updated last year
- ☆27Jun 5, 2025Updated 8 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆66Oct 18, 2024Updated last year
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆32Apr 12, 2025Updated 10 months ago
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆34Jun 29, 2024Updated last year
- ☆444Oct 16, 2025Updated 4 months ago
- ☆31May 8, 2025Updated 9 months ago
- ☆39May 2, 2024Updated last year
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- Repo for "Centaur: Robust Multimodal Fusion for Human Activity Recognition"☆10Jan 9, 2024Updated 2 years ago
- This is a simple example of how to run the android ADK feature on a basic Arduino Uno with USB Host Shield.☆14May 24, 2011Updated 14 years ago
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆43Mar 14, 2024Updated last year
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆83Jan 14, 2025Updated last year
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆67Sep 13, 2025Updated 5 months ago
- [ICLR'26] Stronger-MAS: A RL Framework for multi LLM agent system☆109Updated this week
- ☆13Nov 5, 2024Updated last year
- the implementation of the algorithm for johnson, cds and neh☆10Nov 28, 2017Updated 8 years ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆10Nov 19, 2024Updated last year
- ☆13May 13, 2025Updated 9 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆113Jun 13, 2025Updated 8 months ago
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- Lucene Search Module for Magento☆22Oct 10, 2010Updated 15 years ago
- ☆53Feb 19, 2025Updated last year
- ☆171Oct 29, 2025Updated 4 months ago
- ☆36Feb 12, 2026Updated 2 weeks ago
- ☆11Dec 22, 2018Updated 7 years ago
- This is a repository for paper titled, PlaSma: Making Small Language Models Better Procedural Knowledge Models for (Counterfactual) Plann…☆14Nov 3, 2023Updated 2 years ago
- Software stack based on ROS for the AutoMiny model cars☆15Oct 13, 2025Updated 4 months ago
- Android app to take geotagged photos and upload them to GeoCam Share☆21Jan 25, 2011Updated 15 years ago