deepseek-ai/Engram

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/deepseek-ai/Engram)

deepseek-ai / Engram

Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models

☆3,792

Alternatives and similar repositories for Engram

Users that are interested in Engram are comparing it to the libraries listed below

Sorting:

verl-project / verl
View on GitHub
verl: Volcano Engine Reinforcement Learning for LLMs
☆19,519Updated this week
deepseek-ai / FlashMLA
View on GitHub
FlashMLA: Efficient Multi-head Latent Attention Kernels
☆12,505Feb 6, 2026Updated 3 weeks ago
fla-org / flash-linear-attention
View on GitHub
🚀 Efficient implementations of state-of-the-art linear attention models
☆4,474Updated this week
deepseek-ai / DeepEP
View on GitHub
DeepEP: an efficient expert-parallel communication library
☆9,005Feb 9, 2026Updated 3 weeks ago
deepseek-ai / DualPipe
View on GitHub
A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.
☆2,926Jan 14, 2026Updated last month
sgl-project / sglang
View on GitHub
SGLang is a high-performance serving framework for large language models and multimodal models.
☆23,905Updated this week
Dao-AILab / flash-attention
View on GitHub
Fast and memory-efficient exact attention
☆22,460Updated this week
MoonshotAI / Moonlight
View on GitHub
Muon is Scalable for LLM Training
☆1,440Aug 3, 2025Updated 7 months ago
Dao-AILab / sonic-moe
View on GitHub
Accelerating MoE with IO and Tile-aware Optimizations
☆597Updated this week
MoonshotAI / MoBA
View on GitHub
MoBA: Mixture of Block Attention for Long-Context LLMs
☆2,073Apr 3, 2025Updated 11 months ago
deepseek-ai / DeepGEMM
View on GitHub
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
☆6,206Updated this week
huggingface / open-r1
View on GitHub
Fully open reproduction of DeepSeek-R1
☆25,910Nov 24, 2025Updated 3 months ago
flashinfer-ai / flashinfer
View on GitHub
FlashInfer: Kernel Library for LLM Serving
☆5,057Updated this week
BytedTsinghua-SIA / DAPO
View on GitHub
An Open-source RL System from ByteDance Seed and Tsinghua AIR
☆1,739May 11, 2025Updated 9 months ago
hkust-nlp / simpleRL-reason
View on GitHub
Simple RL training for reasoning
☆3,830Dec 23, 2025Updated 2 months ago
tile-ai / tilelang
View on GitHub
Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels
☆5,284Updated this week
deepseek-ai / open-infra-index
View on GitHub
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
☆7,970May 15, 2025Updated 9 months ago
Jiayi-Pan / TinyZero
View on GitHub
Minimal reproduction of DeepSeek R1-Zero
☆12,853Updated this week
rllm-org / rllm
View on GitHub
Democratizing Reinforcement Learning for LLMs
☆5,167Updated this week
NVIDIA / Megatron-LM
View on GitHub
Ongoing research training transformer models at scale
☆15,461Updated this week
MiniMax-AI / MiniMax-01
View on GitHub
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
☆3,356Jul 7, 2025Updated 7 months ago
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆71,883Updated this week
kvcache-ai / Mooncake
View on GitHub
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
☆4,843Updated this week
THUDM / slime
View on GitHub
slime is an LLM post-training framework for RL Scaling.
☆4,536Updated this week
mit-han-lab / flash-moba
View on GitHub
☆226Nov 19, 2025Updated 3 months ago
OpenRLHF / OpenRLHF
View on GitHub
An Easy-to-use, Scalable and High-performance Agentic RL Framework based on Ray (PPO & DAPO & REINFORCE++ & TIS & vLLM & Ray & Async RL)
☆9,084Updated this week
tokenbender / mHC-manifold-constrained-hyper-connections
View on GitHub
implementations and experimentation on mHC by deepseek - https://arxiv.org/abs/2512.24880
☆317Feb 17, 2026Updated 2 weeks ago
deepseek-ai / EPLB
View on GitHub
Expert Parallelism Load Balancer
☆1,351Mar 24, 2025Updated 11 months ago
deepseek-ai / DeepSeek-V3.2-Exp
View on GitHub
☆1,495Nov 18, 2025Updated 3 months ago
stepfun-ai / Step3
View on GitHub
☆451Aug 10, 2025Updated 6 months ago
GeeeekExplorer / nano-vllm
View on GitHub
Nano vLLM
☆11,920Nov 3, 2025Updated 4 months ago
inclusionAI / AReaL
View on GitHub
Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.
☆3,586Updated this week
QwenLM / Qwen3
View on GitHub
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
☆26,713Jan 9, 2026Updated last month
fla-org / native-sparse-attention
View on GitHub
🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"
☆969Feb 5, 2026Updated last month
nil0x9 / flash-muon
View on GitHub
Flash-Muon: An Efficient Implementation of Muon Optimizer
☆237Jun 15, 2025Updated 8 months ago
huggingface / trl
View on GitHub
Train transformer language models with reinforcement learning.
☆17,460Updated this week
hiyouga / EasyR1
View on GitHub
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
☆4,649Updated this week
deepseek-ai / 3FS
View on GitHub
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
☆9,730Feb 25, 2026Updated last week
pytorch / torchtitan
View on GitHub
A PyTorch native platform for training generative AI models
☆5,098Updated this week