zjunlp / AutoAct
[ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
☆221Updated 3 months ago
Alternatives and similar repositories for AutoAct:
Users that are interested in AutoAct are comparing it to the libraries listed below
- This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgen…☆275Updated 9 months ago
- FireAct: Toward Language Agent Fine-tuning☆275Updated last year
- A new tool learning benchmark aiming at well-balanced stability and reality, based on ToolBench.☆146Updated 3 weeks ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆139Updated last month
- An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]☆311Updated 11 months ago
- [NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents☆217Updated 3 months ago
- ☆220Updated last year
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆142Updated 11 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆103Updated last month
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆239Updated 11 months ago
- ☆228Updated last year
- ☆279Updated 9 months ago
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆86Updated last year
- ☆143Updated 10 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆244Updated 3 weeks ago
- InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)☆123Updated 4 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆89Updated 2 months ago
- [COLING 2025] ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆65Updated 5 months ago
- ☆180Updated 3 months ago
- ☆121Updated 11 months ago
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆138Updated 6 months ago
- Generative Judge for Evaluating Alignment☆236Updated last year
- ☆132Updated 4 months ago
- ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)☆262Updated last year
- ☆47Updated 4 months ago
- AWM: Agent Workflow Memory☆269Updated 3 months ago
- connecting humans and agents☆83Updated 5 months ago
- ☆94Updated 4 months ago
- [ACL 2024] LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement☆181Updated last year
- Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"☆280Updated 6 months ago