wangyu-ustc / MemoryLLM
The official implementation of the ICML 2024 paper "MemoryLLM: Towards Self-Updatable Large Language Models" and "M+: Extending MemoryLLM with Scalable Long-Term Memory"
☆153Updated 3 months ago
Alternatives and similar repositories for MemoryLLM
Users that are interested in MemoryLLM are comparing it to the libraries listed below
Sorting:
- The official implementation of Self-Play Preference Optimization (SPPO)☆550Updated 3 months ago
- Codebase for Iterative DPO Using Rule-based Rewards☆245Updated last month
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆167Updated 5 months ago
- Mulberry, an o1-like Reasoning and Reflection MLLM Implemented via Collective MCTS☆1,175Updated last month
- Recipes to train the self-rewarding reasoning LLMs.☆216Updated 2 months ago
- DeepRetrieval - Hacking 🔥Real Search Engines and Retrievers with LLM via RL☆491Updated last week
- Official code of paper "Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning Models"☆24Updated this week
- AvaTaR: Optimizing LLM Agents for Tool Usage via Contrastive Reasoning (NeurIPS 2024)☆199Updated 3 weeks ago
- [NeurIPS2024] Twin-Merging: Dynamic Integration of Modular Expertise in Model Merging☆135Updated 2 months ago
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models☆44Updated 3 months ago
- ✨ A synthetic dataset generation framework that produces diverse coding questions and verifiable solutions - all in one framwork☆217Updated last month
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆143Updated 7 months ago
- MAKGED is the first multi-agent framework for collaborative error detection in knowledge graphs.☆28Updated 2 months ago
- Benchmarking LLMs via Uncertainty Quantification☆226Updated last year
- This is the official repository for Inheritune.☆111Updated 3 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)☆135Updated 6 months ago
- Unified KV Cache Compression Methods for Auto-Regressive Models☆1,063Updated 4 months ago
- STaRK: Benchmarking LLM Retrieval on Textual and Relational Knowledge Bases (NeurIPS D&B 2024)☆309Updated 4 months ago
- "AnyGraph: Graph Foundation Model in the Wild"☆210Updated 7 months ago
- [ACL 2024] User-friendly evaluation framework: Eval Suite & Benchmarks: UHGEval, HaluEval, HalluQA, etc.☆166Updated 6 months ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆202Updated this week
- Advancing Language Model Reasoning through Reinforcement Learning and Inference Scaling☆101Updated 3 months ago
- [NeurIPS 2024] BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language Models☆258Updated 2 months ago
- [ICLR 2025🔥] SVD-LLM & [NAACL 2025🔥] SVD-LLM V2☆203Updated last month
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated 2 months ago
- A recipe for online RLHF and online iterative DPO.☆511Updated 4 months ago
- ☆160Updated this week
- [ICLR 2025] BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments☆36Updated 3 months ago
- [ICML 2025] Programming Every Example: Lifting Pre-training Data Quality Like Experts at Scale☆246Updated this week
- FuseAI Project☆86Updated 3 months ago