BytedTsinghua-SIA / MemAgentLinks
A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.
☆157Updated this week
Alternatives and similar repositories for MemAgent
Users that are interested in MemAgent are comparing it to the libraries listed below
Sorting:
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆223Updated 2 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆150Updated last month
- ReasonFlux Series - A family of LLM post-training algorithms focusing on data selection, reinforcement learning, and inference scaling☆447Updated last week
- ☆138Updated 2 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆121Updated 3 months ago
- Parallel Scaling Law for Language Model — Beyond Parameter and Inference Time Scaling☆410Updated last month
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆174Updated 3 weeks ago
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆197Updated last week
- Code for the paper: "Learning to Reason without External Rewards"☆317Updated this week
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆226Updated last month
- ☆183Updated last week
- ☆266Updated last month
- ☆609Updated last month
- ☆154Updated 2 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆503Updated 2 months ago
- Efficient Agent Training for Computer Use☆111Updated last month
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆227Updated last month
- official repository for “Reinforcement Learning for Reasoning in Large Language Models with One Training Example”☆315Updated last week
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆104Updated 4 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆220Updated 2 months ago
- A version of verl to support tool use☆288Updated this week
- ☆238Updated last month
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆268Updated 4 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆244Updated 2 months ago
- ☆270Updated last month
- Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning☆188Updated 3 months ago
- ☆277Updated last month
- ☆205Updated 4 months ago
- A lightweight reproduction of DeepSeek-R1-Zero with indepth analysis of self-reflection behavior.☆244Updated 2 months ago
- ☆318Updated 9 months ago