thu-coai / SPaR
☆42Updated 2 months ago
Alternatives and similar repositories for SPaR:
Users that are interested in SPaR are comparing it to the libraries listed below
- Hammer: Robust Function-Calling for On-Device Language Models via Function Masking☆57Updated last month
- ☆81Updated 10 months ago
- ☆23Updated 5 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆51Updated 8 months ago
- ☆98Updated 2 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆116Updated 3 months ago
- Reformatted Alignment☆114Updated 4 months ago
- ☆48Updated 11 months ago
- ☆53Updated 3 months ago
- ☆15Updated 4 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆72Updated last month
- MTU-Bench: A Multi-granularity Tool-Use Benchmark for Large Language Models☆38Updated this week
- ☆89Updated 2 months ago
- The source code and dataset mentioned in the paper Seal-Tools: Self-Instruct Tool Learning Dataset for Agent Tuning and Detailed Benchmar…☆42Updated 3 months ago
- ☆92Updated 3 weeks ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆81Updated 4 months ago
- ☆52Updated 5 months ago
- [ICLR'24 spotlight] Tool-Augmented Reward Modeling☆44Updated last month
- ☆90Updated 2 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆110Updated 2 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆45Updated 2 months ago
- We aim to provide the best references to search, select, and synthesize high-quality and large-quantity data for post-training your LLMs.☆49Updated 4 months ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆43Updated last year
- Open source code of the paper: "OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain"☆48Updated last month
- The GitHub repository for the paper "Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning" accepte…☆18Updated 11 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆83Updated last year
- Toy implementation of Strawberry☆30Updated 4 months ago
- Automatic prompt optimization framework for multi-step agent tasks.☆27Updated 3 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆29Updated 8 months ago