wangyu-ustc / Mem-alphaLinks
The official implementation of the paper "Mem-α: Learning Memory Construction via Reinforcement Learning"
☆113Updated last week
Alternatives and similar repositories for Mem-alpha
Users that are interested in Mem-alpha are comparing it to the libraries listed below
Sorting:
- MemGen: Weaving Generative Latent Memory for Self-Evolving Agents☆238Updated 2 weeks ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".☆91Updated last month
- ☆319Updated 6 months ago
- End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning☆332Updated 2 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆257Updated 7 months ago
- ☆182Updated last month
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆139Updated 9 months ago
- A Comprehensive Survey on Long Context Language Modeling☆213Updated 2 weeks ago
- xVerify: Efficient Answer Verifier for Reasoning Model Evaluations☆140Updated last month
- Generative AI Act II: Test Time Scaling Drives Cognition Engineering☆209Updated 7 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆116Updated 4 months ago
- ☆245Updated 4 months ago
- ☆392Updated last month
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆293Updated last month
- MiroRL is an MCP-first reinforcement learning framework for deep research agent.☆182Updated 3 months ago
- A Collection of Papers about Memory for Language Agents☆188Updated 2 weeks ago
- A Framework for LLM-based Multi-Agent Reinforced Training and Inference☆366Updated 3 weeks ago
- Towards a Unified View of Large Language Model Post-Training☆192Updated 3 months ago
- Open Source Implementation of Alita: Generalist Agent Enabling Scalable Agentic Reasoning with Minimal Predefinition and Maximal Self-Evo…☆95Updated 4 months ago
- ☆138Updated 3 months ago
- Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models☆42Updated 2 months ago
- ☆213Updated 9 months ago
- ☆344Updated 4 months ago
- ☆171Updated last week
- Reproducing R1 for Code with Reliable Rewards☆277Updated 7 months ago
- Official repository for paper: O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning☆98Updated 9 months ago
- ☆173Updated 7 months ago
- Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework☆141Updated 2 weeks ago
- The official repo for "AceCoder: Acing Coder RL via Automated Test-Case Synthesis" [ACL25]☆95Updated 8 months ago
- SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks☆117Updated 3 weeks ago