XinyuanWangCS / PromptAgentLinks

This is the official repo for "PromptAgent: Strategic Planning with Language Models Enables Expert-level Prompt Optimization". PromptAgent is a novel automatic prompt optimization method that autonomously crafts prompts equivalent in quality to those handcrafted by experts, i.e., expert-level prompts.

☆341

Alternatives and similar repositories for PromptAgent

Users that are interested in PromptAgent are comparing it to the libraries listed below

Sorting:

zjunlp / AutoAct
[ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning
☆231Updated 10 months ago
CraftJarvis / RAT
Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".
☆249Updated last year
zjunlp / KnowAgent
[NAACL 2025] KnowAgent: Knowledge-Augmented Planning for LLM-Based Agents
☆251Updated 10 months ago
Reason-Wang / ToolGen
[ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"
☆165Updated 8 months ago
thunlp / ChatEval
Codes for our paper "ChatEval: Towards Better LLM-based Evaluators through Multi-Agent Debate"
☆308Updated last year
anchen1011 / FireAct
FireAct: Toward Language Agent Fine-tuning
☆286Updated 2 years ago
ADaM-BJTU / AutoCoA
AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…
☆129Updated 8 months ago
quchangle1 / LLM-Tool-Survey
This is the repository for the Tool Learning survey.
☆457Updated 3 months ago
guosyjlu / DS-Agent
Official implementation of "DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning" in ICML'24
☆219Updated last year
fate-ubw / RAGLAB
[EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
☆310Updated last year
InfiAgent / InfiAgent
InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)
☆160Updated 6 months ago
hkust-nlp / AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents [NeurIPS 2024 Oral]
☆366Updated last year
nuster1128 / LLM_Agent_Memory_Survey
☆427Updated 4 months ago
YangLing0818 / buffer-of-thought-llm
[NeurIPS 2024 Spotlight] Buffer of Thoughts: Thought-Augmented Reasoning with Large Language Models
☆671Updated 5 months ago
Zoeyyao27 / CoT-Igniting-Agent
This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reaso…
☆367Updated 2 years ago
SALT-NLP / DyLAN
Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization
☆181Updated last year
Ayanami0730 / deep_research_bench
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
☆496Updated last week
OSU-NLP-Group / TravelPlanner
[ICML'24 Spotlight] "TravelPlanner: A Benchmark for Real-World Planning with Language Agents"
☆446Updated 3 weeks ago
diagram-of-thought / diagram-of-thought
Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)
☆188Updated 3 months ago
chanchimin / RQ-RAG
Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"
☆194Updated last year
jxzhangjhu / Awesome-LLM-Prompt-Optimization
Awesome-LLM-Prompt-Optimization: a curated list of advanced prompt optimization and tuning methods in Large Language Models
☆386Updated last year
WooooDyy / AgentGym
Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…
☆658Updated 2 months ago
zhongwanjun / MemoryBank-SiliconFriend
Source code and demo for memory bank and SiliconFriend
☆367Updated 2 years ago
X-PLUG / Multi-LLM-Agent
☆232Updated last year
StonyBrookNLP / appworld
🌍 AppWorld: A Controllable World of Apps and People for Benchmarking Function Calling and Interactive Coding Agent, ACL'24 Best Resource…
☆324Updated 2 weeks ago
liyucheng09 / Selective_Context
Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.
☆402Updated last year
TIGER-AI-Lab / LongRAG
Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".
☆242Updated last year
OpenBMB / RAGEval
☆203Updated 8 months ago
Skytliang / Multi-Agents-Debate
MAD: The first work to explore Multi-Agent Debate with Large Language Models :D
☆466Updated 10 months ago
Ber666 / ToolkenGPT
ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)
☆264Updated last year