☆34May 24, 2025Updated 10 months ago
Alternatives and similar repositories for StepTool
Users that are interested in StepTool are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆30Apr 2, 2025Updated 11 months ago
- ☆25Sep 1, 2025Updated 6 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆54Jun 6, 2025Updated 9 months ago
- ☆28Jun 5, 2025Updated 9 months ago
- (ACL 2025) Divide-Then-Aggregate: An Efficient Tool Learning Method via Parallel Tool Invocation☆12May 21, 2025Updated 10 months ago
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆222Apr 15, 2025Updated 11 months ago
- (ACL2025 Findings) Official code for the paper "STeCa: Step-level Trajectory Calibration for LLM Agent Learning"☆26Mar 2, 2026Updated 3 weeks ago
- ☆27Jul 11, 2024Updated last year
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆202Apr 17, 2025Updated 11 months ago
- ☆39May 2, 2024Updated last year
- A Pytorch implementation of Collaborative Metric Learning (CML)☆11Oct 13, 2020Updated 5 years ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆66Oct 18, 2024Updated last year
- ☆15Feb 21, 2024Updated 2 years ago
- ☆34Jul 13, 2023Updated 2 years ago
- [NeurIPS D&B Track 2024] Source code for the paper "Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge…☆24May 2, 2025Updated 10 months ago
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)☆16Jul 2, 2024Updated last year
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆18Oct 17, 2023Updated 2 years ago
- ☆172Oct 29, 2025Updated 4 months ago
- [EMNLP 2024] Source code for the paper "Learning Planning-based Reasoning with Trajectory Collection and Process Rewards Synthesizing".☆83Jan 14, 2025Updated last year
- [NAACL 2024] Making Language Models Better Tool Learners with Execution Feedback☆43Mar 14, 2024Updated 2 years ago
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆31Dec 24, 2025Updated 3 months ago
- The repository for ACL 2024 paper "TimeBench: A Comprehensive Evaluation of Temporal Reasoning Abilities in Large Language Models"☆34Jun 29, 2024Updated last year
- ☆13Jan 14, 2026Updated 2 months ago
- A comrephensive collection of learning from rewards in the post-training and test-time scaling of LLMs, with a focus on both reward model…☆64Jun 13, 2025Updated 9 months ago
- Official code for "MAmmoTH2: Scaling Instructions from the Web" [NeurIPS 2024]☆149Oct 27, 2024Updated last year
- [TMLR] Triple Preference Optimization☆30Feb 19, 2025Updated last year
- ☆459Oct 16, 2025Updated 5 months ago
- Multi-turn RL framework for aligning models to be tutors instead of answerers. EMNLP 2025 Oral☆33Dec 11, 2025Updated 3 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆137Feb 16, 2026Updated last month
- ☆31May 8, 2025Updated 10 months ago
- A curated list of cutting-edge research papers and resources on Long Chain-of-Thought (CoT) Reasoning with Tools.☆46Dec 17, 2025Updated 3 months ago
- ☆13May 13, 2025Updated 10 months ago
- Source code for our paper: "LoGU: Long-form Generation with Uncertainty Expressions".☆17May 27, 2025Updated 9 months ago
- ☆919Jul 24, 2024Updated last year
- Code for Findings of ACL 2021 paper "Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain …☆19Dec 16, 2022Updated 3 years ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆11Nov 19, 2024Updated last year
- Code repository for the BMVC 2022 paper: Geometry Driven Progressive Warping for One Shot Face Animation☆12Jan 6, 2023Updated 3 years ago
- MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)☆76Aug 20, 2025Updated 7 months ago
- using pvanet framework train mobilenet-v2 for objects detection, papaer: https://arxiv.org/abs/1611.08588☆13Feb 13, 2019Updated 7 years ago