LUMIA-Group / MemoryDecoderLinks
The official implementation of the paper "Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models" (NeurIPS 2025 Poster).
β68Updated 4 months ago
Alternatives and similar repositories for MemoryDecoder
Users that are interested in MemoryDecoder are comparing it to the libraries listed below
Sorting:
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.β168Updated 7 months ago
- π LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Trainingβ91Updated last year
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningβ89Updated 11 months ago
- β125Updated last year
- β119Updated 4 months ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compressionβ131Updated 9 months ago
- β218Updated 2 months ago
- β46Updated 4 months ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chainsβ71Updated 6 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".β95Updated 2 months ago
- β175Updated last year
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-ofβ¦β76Updated 8 months ago
- β137Updated last week
- β33Updated 7 months ago
- β47Updated 9 months ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.β164Updated 4 months ago
- [EMNLP 2024 Findingsπ₯] Official implementation of ": LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Inβ¦β103Updated last year
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningβ70Updated 6 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.β152Updated last week
- code for ACL24 "MELoRA: Mini-Ensemble Low-Rank Adapter for Parameter-Efficient Fine-Tuning"β33Updated 11 months ago
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Modelsβ32Updated last year
- β39Updated 6 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".β141Updated last year
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"β41Updated 7 months ago
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingβ62Updated 8 months ago
- β59Updated 6 months ago
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)β172Updated 2 months ago
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"β28Updated 7 months ago
- Extrapolating RLVR to General Domains without Verifiersβ191Updated 5 months ago
- [ArXiv] V2PE: Improving Multimodal Long-Context Capability of Vision-Language Models with Variable Visual Position Encodingβ59Updated last year