wangyu-ustc / MemoryLLMLinks
The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM with Scalable Long-Term Memory"
☆255Updated 3 months ago
Alternatives and similar repositories for MemoryLLM
Users that are interested in MemoryLLM are comparing it to the libraries listed below
Sorting:
- MiroThinker is open-source agentic models trained for deep research and complex tool use scenarios.☆704Updated this week
- (ICML'25 Outstanding) CollabLLM: From Passive Responders to Active Collaborators☆256Updated last month
- A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in Large Language Models☆97Updated last week
- The official implementation of Self-Play Preference Optimization (SPPO)☆582Updated 9 months ago
- SSRL: Self-Search Reinforcement Learning☆151Updated 2 months ago
- ☆158Updated 3 weeks ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆263Updated 4 months ago
- ☆90Updated 6 months ago
- ☆502Updated 2 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆161Updated 3 weeks ago
- RL Scaling and Test-Time Scaling (ICML'25)☆112Updated 9 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]☆178Updated 4 months ago
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆83Updated 7 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆118Updated 6 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆280Updated last month
- Official code of paper "Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models"☆82Updated 5 months ago
- [EMNLP'2025 Industry] Repo for "Z1: Efficient Test-time Scaling with Code"☆66Updated 7 months ago
- Recipes to train the self-rewarding reasoning LLMs.☆227Updated 8 months ago
- ☆323Updated 2 months ago
- WideSearch: Benchmarking Agentic Broad Info-Seeking☆101Updated last month
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆204Updated last month
- Codebase for Iterative DPO Using Rule-based Rewards☆261Updated 7 months ago
- Efficient Agent Training for Computer Use☆132Updated 2 months ago
- Code, benchmark and environment for "ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows"☆117Updated 2 months ago
- Test-time preferenece optimization (ICML 2025).☆169Updated 6 months ago
- [COLM’25] DeepRetrieval — 🔥 The First Search Agent Trained by On-Policy Reinforcement Learning☆671Updated last month
- ☆210Updated 5 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compression☆123Updated 7 months ago
- MPO: Boosting LLM Agents with Meta Plan Optimization (EMNLP 2025 Findings)☆73Updated 2 months ago
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆65Updated 5 months ago