BytedTsinghua-SIA / MemAgentLinks
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
☆506Updated last week
Alternatives and similar repositories for MemAgent
Users that are interested in MemAgent are comparing it to the libraries listed below
Sorting:
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆532Updated 3 months ago
- ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling☆462Updated last week
- ☆740Updated last month
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆537Updated 2 months ago
- Scaling RL on advanced reasoning models☆530Updated last week
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆603Updated 2 months ago
- ☆293Updated last month
- Build, evaluate and train General Multi-Agent Assistance with ease☆393Updated this week
- [Up-to-date] Awesome Agentic Deep Research Resources☆354Updated 2 weeks ago
- ☆800Updated last month
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆1,100Updated 2 months ago
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆571Updated 4 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆121Updated 4 months ago
- AN O1 REPLICATION FOR CODING☆335Updated 7 months ago
- The official code of “Agentic Reinforced Policy Optimization”, an agentic RL algorithm optimization.☆229Updated this week
- ☆293Updated 2 months ago
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆268Updated 5 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆231Updated 2 months ago
- ☆166Updated 3 months ago
- The evaluation benchmark on MCP servers☆163Updated 2 months ago
- ☆263Updated 3 weeks ago
- SkyRL: A Modular Full-stack RL Library for LLMs☆651Updated this week
- Awesome Agent Training☆198Updated last week
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆424Updated last month
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆697Updated last week
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆417Updated 2 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆153Updated last month
- ☆287Updated 2 months ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆233Updated last week
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆335Updated 4 months ago