deepseek-ai / EngramLinks
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
☆3,545Updated 3 weeks ago
Alternatives and similar repositories for Engram
Users that are interested in Engram are comparing it to the libraries listed below
Sorting:
- ☆1,282Updated 2 weeks ago
- ☆1,465Updated 2 months ago
- A framework for efficient model inference with omni-modality models☆2,491Updated this week
- slime is an LLM post-training framework for RL Scaling.☆3,571Updated last week
- MoBA: Mixture of Block Attention for Long-Context LLMs☆2,038Updated 10 months ago
- Muon is Scalable for LLM Training☆1,421Updated 6 months ago
- ☆1,278Updated 2 months ago
- ☆1,540Updated 2 months ago
- ☆814Updated 7 months ago
- VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo☆1,601Updated this week
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆3,470Updated this week
- GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning☆2,145Updated last week
- ☆1,759Updated 4 months ago
- A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.☆3,297Updated 2 weeks ago
- WeDLM: The fastest diffusion language model with standard causal attention and native KV cache compatibility, delivering real speedups ov…☆597Updated 3 weeks ago
- Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…☆3,366Updated 3 weeks ago
- An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models☆2,747Updated this week
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆875Updated 6 months ago
- ☆1,222Updated 3 months ago
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,236Updated 5 months ago
- Build, evaluate and train General Multi-Agent Assistance with ease☆1,113Updated this week
- Visual Causal Flow☆1,306Updated last week
- ☆1,385Updated 4 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆738Updated 7 months ago
- MiMo-V2-Flash: Efficient Reasoning, Coding, and Agentic Foundation Model☆1,041Updated 3 weeks ago
- 🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"☆964Updated 10 months ago
- Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities☆1,151Updated 6 months ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,715Updated 8 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆902Updated this week
- Official Repo for Open-Reasoner-Zero☆2,086Updated 8 months ago