LUMIA-Group / MemoryDecoderLinks
The official implementation of the paper "Memory Decoder: A Pretrained, Plug-and-Play Memory for Large Language Models" (NeurIPS 2025 Poster).
โ69Updated 4 months ago
Alternatives and similar repositories for MemoryDecoder
Users that are interested in MemoryDecoder are comparing it to the libraries listed below
Sorting:
- ๐ LLaMA-MoE v2: Exploring Sparsity of LLaMA from Perspective of Mixture-of-Experts with Post-Trainingโ91Updated last year
- ACL'2025: SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs. and preprint: SoftCoT++: Test-Time Scaling with Soft Chain-ofโฆโ76Updated 8 months ago
- โ119Updated 4 months ago
- Official repository for paper "DeepCritic: Deliberate Critique with Large Language Models"โ41Updated 7 months ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuningโ89Updated 11 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.โ153Updated 2 weeks ago
- [EMNLP 2025] LightThinker: Thinking Step-by-Step Compressionโ132Updated 9 months ago
- Pre-trained, Scalable, High-performance Reward Models via Policy Discriminative Learning.โ164Updated 4 months ago
- [AAAI 2026] Official codebase for "GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning".โ95Updated 2 months ago
- Test-time preferenece optimization (ICML 2025).โ178Updated 9 months ago
- Code for Heimaโ59Updated 9 months ago
- RM-R1: Unleashing the Reasoning Potential of Reward Modelsโ158Updated 7 months ago
- โ59Updated 6 months ago
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chainsโ74Updated 6 months ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.โ169Updated last week
- CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models (NeurIPS 2025)โ172Updated 3 months ago
- [ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoningโ70Updated 6 months ago
- Official Repo for SvS: A Self-play with Variational Problem Synthesis strategy for RLVR trainingโ54Updated last month
- โ178Updated 2 months ago
- โ46Updated 4 months ago
- โ47Updated 9 months ago
- instruction-following benchmark for large reasoning modelsโ44Updated 5 months ago
- โ175Updated last year
- Extrapolating RLVR to General Domains without Verifiersโ196Updated 5 months ago
- Large Language Models Can Self-Improve in Long-context Reasoningโ72Updated last year
- Official Repository of LatentSeekโ76Updated 8 months ago
- MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Modelsโ32Updated last year
- [ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shapingโ62Updated 8 months ago
- โ125Updated last year
- [EMNLP 2025] TokenSkip: Controllable Chain-of-Thought Compression in LLMsโ200Updated 2 months ago