zwq2018 / Agent-Pro
The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
☆94Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for Agent-Pro
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆60Updated last month
- ☆83Updated 7 months ago
- Environments, tools, and benchmarks for general computer agents☆172Updated 3 weeks ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆49Updated this week
- Towards Large Multimodal Models as Visual Foundation Agents☆120Updated this week
- ☆116Updated 5 months ago
- FireAct: Toward Language Agent Fine-tuning☆255Updated last year
- ☆193Updated 6 months ago
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆119Updated last week
- ☆78Updated last month
- ☆89Updated 7 months ago
- ☆73Updated 11 months ago
- ☆54Updated last month
- Offical Repo for "Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale"☆191Updated last month
- Reformatted Alignment☆112Updated last month
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆99Updated 3 weeks ago
- ProAgent: Building Proactive Cooperative Agents with Large Language Models☆60Updated 7 months ago
- ☆89Updated 3 months ago
- 🤠 Agent-as-a-Judge and DevAI dataset☆192Updated this week
- [ACL 2024] AUTOACT: Automatic Agent Learning from Scratch for QA via Self-Planning☆178Updated last month
- ☆29Updated 2 weeks ago
- ☆130Updated 6 months ago
- ToolEyes: Fine-Grained Evaluation for Tool Learning Capabilities of Large Language Models in Real-world Scenarios☆62Updated 7 months ago
- connecting humans and agents☆45Updated 2 weeks ago
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆74Updated 9 months ago
- Official implementation of paper "Cumulative Reasoning With Large Language Models" (https://arxiv.org/abs/2308.04371)☆287Updated 2 months ago
- ☆287Updated 2 months ago
- ☆48Updated 8 months ago
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆46Updated 8 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆204Updated this week