[Survey] A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Foundation Models and Lifelong Agentic Systems
β2,161May 16, 2026Updated last week
Alternatives and similar repositories for Awesome-Self-Evolving-Agents
Users that are interested in Awesome-Self-Evolving-Agents are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- β1,147Oct 15, 2025Updated 7 months ago
- π EvoAgentX: Building a Self-Evolving Ecosystem of AI Agentsβ3,023May 15, 2026Updated last week
- β1,774Jan 20, 2026Updated 4 months ago
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Frameworkβ21,514Updated this week
- Tongyi Deep Research, the Leading Open-source Deep Research Agentβ18,892Feb 27, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Revisiting Mid-training in the Era of Reinforcement Learning Scalingβ188Jul 23, 2025Updated 10 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"β104Apr 21, 2026Updated last month
- Autonomous Agents (LLMs) research papers. Updated Daily.β1,282Updated this week
- Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL.β572Sep 8, 2025Updated 8 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.β2,668Apr 14, 2026Updated last month
- π This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.β445Apr 28, 2026Updated 3 weeks ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ74Jul 13, 2025Updated 10 months ago
- [ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)β1,010Apr 13, 2026Updated last month
- β884Aug 30, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The absolute trainer to light up AI agents.β17,196Apr 29, 2026Updated 3 weeks ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiβ¦β784Sep 11, 2025Updated 8 months ago
- β92Dec 5, 2024Updated last year
- [Up-to-date] Large Language Model Agent: A Survey on Methodology, Applications and Challengesβ2,707Nov 7, 2025Updated 6 months ago
- O1 Replication Journeyβ2,000Jan 14, 2025Updated last year
- The paper list of the 86-page SCIS cover paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et aβ¦β8,134Sep 12, 2025Updated 8 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learningβ1,430May 18, 2026Updated last week
- Latest Advances on System-2 Reasoningβ1,352Jun 8, 2025Updated 11 months ago
- β1,342Feb 12, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- π« CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.orgβ16,988May 18, 2026Updated last week
- [AAAI 2026] ReCode: Reinforced Code Knowledge Editing for API Updatesβ26Jul 1, 2025Updated 10 months ago
- β337May 31, 2025Updated 11 months ago
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemenβ¦β749Feb 15, 2026Updated 3 months ago
- Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)β71,468Updated this week
- Tablestore for Agent Memoryβ48Dec 19, 2025Updated 5 months ago
- GraphSearch: An Agentic Deep Searching Workflow for Graph Retrieval-Augmented Generationβ98Apr 9, 2026Updated last month
- Official Implementation of wd1β30Sep 25, 2025Updated 8 months ago
- "AutoAgent: Fully-Automated and Zero-Code LLM Agent Framework"β9,327Oct 16, 2025Updated 7 months ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- β513Oct 11, 2025Updated 7 months ago
- Democratizing Reinforcement Learning for LLMsβ5,548Updated this week
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasonersβ86May 21, 2025Updated last year
- β213Dec 20, 2024Updated last year
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)β462Aug 26, 2025Updated 8 months ago
- ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning & ReCall: Learning to Reason with Tool Call for LLMs via Reiβ¦β1,385May 16, 2025Updated last year
- A library for advanced large language model reasoningβ2,343Jun 10, 2025Updated 11 months ago