zwq2018 / Agent-Pro
The Code Repo for Agent-Pro: Learning to Evolve via Policy-Level Reflection and Optimization
☆109Updated 7 months ago
Alternatives and similar repositories for Agent-Pro:
Users that are interested in Agent-Pro are comparing it to the libraries listed below
- ☆126Updated 3 months ago
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆61Updated 6 months ago
- MPO: Boosting LLM Agents with Meta Plan Optimization☆49Updated last month
- Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)☆134Updated 5 months ago
- ☆47Updated 4 months ago
- ☆121Updated 10 months ago
- connecting humans and agents☆81Updated 4 months ago
- Code and data for OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis☆126Updated 2 weeks ago
- ☆94Updated 4 months ago
- ☆101Updated 4 months ago
- Official Code for "Coser: Coordinating LLM-Based Persona Simulation of Established Roles"☆55Updated last week
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆124Updated 4 months ago
- rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking☆38Updated 3 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆221Updated 3 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆204Updated 2 months ago
- Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆60Updated this week
- On Memorization of Large Language Models in Logical Reasoning☆63Updated 3 weeks ago
- [NeurIPS 2024] Official Implementation for Optimus-1: Hybrid Multimodal Memory Empowered Agents Excel in Long-Horizon Tasks☆71Updated last week
- Official Implementation of Dynamic LLM-Agent Network: An LLM-agent Collaboration Framework with Agent Team Optimization☆136Updated 11 months ago
- ☆131Updated 4 months ago
- Code for NeurIPS 2024 paper "AutoManual: Constructing Instruction Manuals by LLM Agents via Interactive Environmental Learning"☆40Updated 5 months ago
- ☆41Updated 5 months ago
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆171Updated last month
- Implementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" pr…☆98Updated last year
- AAAI24(Oral) ProAgent: Building Proactive Cooperative Agents with Large Language Models☆80Updated last month
- Reformatted Alignment☆115Updated 6 months ago
- ☆143Updated 9 months ago
- [preprint] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆43Updated 3 months ago
- ☆91Updated last year
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆199Updated this week