thu-coai / SPaR
☆35Updated 3 weeks ago
Alternatives and similar repositories for SPaR:
Users that are interested in SPaR are comparing it to the libraries listed below
- ☆61Updated 2 months ago
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆44Updated 3 weeks ago
- NaturalCodeBench (Findings of ACL 2024)☆61Updated 2 months ago
- ☆48Updated 10 months ago
- ☆81Updated 8 months ago
- Automatic prompt optimization framework for multi-step agent tasks.☆26Updated last month
- ☆95Updated last month
- ☆23Updated 3 months ago
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆57Updated 10 months ago
- Official implementation of the paper "From Complex to Simple: Enhancing Multi-Constraint Complex Instruction Following Ability of Large L…☆42Updated 6 months ago
- ☆25Updated last month
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆23Updated 2 months ago
- ☆36Updated 4 months ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆44Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆29Updated 7 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆24Updated 3 weeks ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated 10 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated last week
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆50Updated 7 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆48Updated 3 months ago
- ☆84Updated last month
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆60Updated 3 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆52Updated last week
- Reformatted Alignment☆113Updated 3 months ago
- ☆47Updated 2 months ago
- Natural Language Reinforcement Learning☆66Updated 3 weeks ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆109Updated 2 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆33Updated 11 months ago
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆51Updated 2 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆65Updated last month