wangyu-ustc / MemoryLLMLinks
The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM with Scalable Long-Term Memory"
☆159Updated 3 months ago
Alternatives and similar repositories for MemoryLLM
Users that are interested in MemoryLLM are comparing it to the libraries listed below
Sorting:
- The official implementation of Self-Play Preference Optimization (SPPO)☆565Updated 4 months ago
- Codebase for Iterative DPO Using Rule-based Rewards☆246Updated last month
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆1,185Updated 2 months ago
- Recipes to train the self-rewarding reasoning LLMs.☆222Updated 3 months ago
- Official code of paper "Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models"☆74Updated last week
- DeepRetrieval - 🔥 Training Search Agent with Retrieval Outcomes via Reinforcement Learning☆537Updated last week
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆168Updated 6 months ago
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models☆45Updated 4 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆248Updated 3 weeks ago
- MAKGED is the first multi-agent framework for collaborative error detection in knowledge graphs.☆28Updated 3 months ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆136Updated 2 months ago
- Unified KV Cache Compression Methods for Auto-Regressive Models☆1,108Updated 5 months ago
- AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning (NeurIPS 2024)☆201Updated last month
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆229Updated this week
- ☆196Updated 3 weeks ago
- [ICLR 2025] BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments☆36Updated 3 months ago
- [ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models☆37Updated 3 weeks ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆135Updated 6 months ago
- [ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2☆210Updated 2 months ago
- Benchmarking LLMs via Uncertainty Quantification☆230Updated last year
- Reformatted Alignment☆114Updated 8 months ago
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆104Updated 4 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆143Updated 8 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆103Updated 2 weeks ago
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆98Updated last month
- A dataset of LLM-generated chain-of-thought steps annotated with mistake location.☆81Updated 9 months ago
- ☆67Updated 2 months ago
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆101Updated 3 months ago
- R1-like Computer-use Agent☆74Updated 2 months ago
- Co-LLM: Learning to Decode Collaboratively with Multiple Language Models☆114Updated last year