yansikuan / memory-r1Links
☆69Updated 3 months ago
Alternatives and similar repositories for memory-r1
Users that are interested in memory-r1 are comparing it to the libraries listed below
Sorting:
- ☆94Updated 8 months ago
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆238Updated 2 weeks ago
- ☆155Updated 2 months ago
- ☆182Updated last month
- [ICML'25 Oral] Multi-agent Architecture Search via Agentic Supernet☆219Updated last month
- The implementation for ICLR 2025 Oral: From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions.☆51Updated 4 months ago
- A Collection of Papers about Memory for Language Agents☆188Updated 3 weeks ago
- ☆42Updated 2 weeks ago
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆31Updated 6 months ago
- ☆69Updated 5 months ago
- The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"☆116Updated last week
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆294Updated last month
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆139Updated 9 months ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆373Updated 3 weeks ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆80Updated last month
- ☆12Updated 9 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆92Updated last month
- ☆95Updated last month
- This is the official GitHub repository for our survey paper "Beyond Single-Turn: A Survey on Multi-Turn Interactions with Large Language …☆150Updated 7 months ago
- A Unified Framework for High-Performance and Extensible LLM Steering☆134Updated last week
- ☆70Updated last year
- The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"☆20Updated last month
- ☆201Updated 4 months ago
- This is the code of MMOA-RAG.☆92Updated 7 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆48Updated 10 months ago
- ResearcherBench: Evaluating Deep AI Research Systems on the Frontiers of Scientific Inquiry☆38Updated 2 months ago
- 🔥🔥🔥 ICLR 2025 Oral. Automating Agentic Workflow Generation.☆352Updated 4 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆66Updated 6 months ago
- ☆22Updated 9 months ago
- Official code for paper "SPA-RL: Reinforcing LLM Agent via Stepwise Progress Attribution"☆56Updated 3 months ago