wangyu-ustc / MemoryLLMLinks
The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM with Scalable Long-Term Memory"
☆163Updated 4 months ago
Alternatives and similar repositories for MemoryLLM
Users that are interested in MemoryLLM are comparing it to the libraries listed below
Sorting:
- The official implementation of Self-Play Preference Optimization (SPPO)☆566Updated 5 months ago
- ☆150Updated this week
- Codebase for Iterative DPO Using Rule-based Rewards☆247Updated 2 months ago
- Recipes to train the self-rewarding reasoning LLMs.☆223Updated 3 months ago
- Official code of paper "Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models"☆78Updated last month
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆1,193Updated 3 months ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆169Updated 6 months ago
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models☆48Updated 5 months ago
- Benchmarking LLMs via Uncertainty Quantification☆234Updated last year
- (ACL 2025 Main) A Comprehensive Benchmark for Code Information Retrieval.☆101Updated last week
- DeepRetrieval - 🔥 Training Search Agent with Retrieval Outcomes via Reinforcement Learning☆569Updated last week
- A library for generating difficulty-scalable, multi-tool, and verifiable agentic tasks with execution trajectories.☆45Updated last week
- ☆86Updated last month
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆231Updated 3 weeks ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆251Updated 3 weeks ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆65Updated this week
- Unified KV Cache Compression Methods for Auto-Regressive Models☆1,147Updated 5 months ago
- ☆210Updated last month
- adds Sequence Parallelism into LLaMA-Factory☆518Updated last week
- (NeurIPS 2024) AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning☆207Updated 2 weeks ago
- From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery☆186Updated this week
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆222Updated last month
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆136Updated 3 months ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆258Updated 3 months ago
- [ICLR Workshop 2025] An official source code for paper "GuardReasoner: Towards Reasoning-based LLM Safeguards".☆143Updated last month
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆72Updated 3 months ago
- RL Scaling and Test-Time Scaling (ICML'25)☆106Updated 5 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆137Updated 7 months ago
- s3 - Efficient Yet Effective Search Agent Training via RL for RAG☆282Updated this week
- ☆121Updated last year