AgentEvolver: Towards Efficient Self-Evolving Agent System
☆1,477Apr 1, 2026Updated 3 months ago
Alternatives and similar repositories for AgentEvolver
Users that are interested in AgentEvolver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ReMe: Memory Management Kit for Agents - Remember Me, Refine Me.☆3,145Updated this week
- ☆30Jul 7, 2025Updated 11 months ago
- Scaling Agentic Environments Automatically.☆66Mar 26, 2026Updated 3 months ago
- This is the official implementation for MA-LoT.☆19Aug 4, 2025Updated 11 months ago
- RAGEN leverages reinforcement learning to train LLM reasoning agents in interactive, stochastic environments.☆2,734Apr 14, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2026] Agentic Reinforced Policy Optimization (ARPO)☆1,066Apr 13, 2026Updated 2 months ago
- [ICLR'26] MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs☆53Apr 17, 2026Updated 2 months ago
- Code and implementations for the ACL 2025 paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhi…☆806May 30, 2026Updated last month
- RAG-RewardBench: Benchmarking Reward Models in Retrieval Augmented Generation for Preference Alignment☆18Dec 19, 2024Updated last year
- Multi-tenant fine-tuning for LLMs with Tinker-compatible API☆53Updated this week
- ☆130Mar 31, 2026Updated 3 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆331Jun 17, 2026Updated 2 weeks ago
- [AAAI 2026] ✨ TSPO: Temporal Sampling Policy Optimization for Long-form Video Language Understanding☆130Nov 12, 2025Updated 7 months ago
- Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework☆305Jan 17, 2026Updated 5 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆69Mar 25, 2026Updated 3 months ago
- MrlX: A Multi-Agent Reinforcement Learning Framework☆211Jan 19, 2026Updated 5 months ago
- ☆324Jan 3, 2026Updated 6 months ago
- verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework☆22,173Jun 27, 2026Updated last week
- An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & VLM & TIS & vLLM & Ray & Asy…☆9,710Jun 17, 2026Updated 2 weeks ago
- [ICML'26] MemEvolve & EvolveLab☆248May 5, 2026Updated last month
- ☆28Mar 17, 2026Updated 3 months ago
- verl-agent is an extension of veRL, designed for training LLM/VLM agents via RL. verl-agent is also the official code for paper "Group-in…☆2,067Jun 9, 2026Updated 3 weeks ago
- OpenJudge: A Unified Framework for Holistic Evaluation and Quality Rewards☆689Jun 17, 2026Updated 2 weeks ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,510Updated this week
- ☆24Feb 3, 2026Updated 5 months ago
- [AAAI'25] SPRING: Learning Scalable and Pluggable Virtual Tokens for Retrieval-Augmented Large Language Models☆26Sep 24, 2025Updated 9 months ago
- Agentic RL on Any Harness at Scale☆594Jun 26, 2026Updated last week
- The code for paper: Hierarchical Document Refinement for Long-context Retrieval-augmented Generation [ACL2025 Oral]☆46Aug 25, 2025Updated 10 months ago
- The official implementation of "EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis".☆166Feb 12, 2026Updated 4 months ago
- Official Repo for Open-Reasoner-Zero☆2,096Jun 2, 2025Updated last year
- ToolOrchestra is an end-to-end RL training framework for orchestrating tools and agentic workflows.☆742Mar 25, 2026Updated 3 months ago
- Step-DeepResearch☆566Mar 24, 2026Updated 3 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆24May 24, 2025Updated last year
- Fathom-DeepResearch: Unlocking Long Horizon Information Retrieval And Synthesis For SLMs☆60Oct 7, 2025Updated 8 months ago
- Tongyi Deep Research, the Leading Open-source Deep Research Agent☆19,570Feb 27, 2026Updated 4 months ago
- LLM that can be trained on 1 or more GPUs for research.☆56May 28, 2026Updated last month
- Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.☆847May 14, 2025Updated last year
- Marketplace ML experiment - training without backprop☆28Sep 9, 2025Updated 9 months ago
- The official repository for paper Evaluating Financial Relational Graphs: Interpretation Before Prediction☆20Jan 2, 2026Updated 6 months ago