deepseek-ai / EngramLinks
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
☆3,545Updated 3 weeks ago
Alternatives and similar repositories for Engram
Users that are interested in Engram are comparing it to the libraries listed below
Sorting:
- ☆1,300Updated last week
- ☆1,475Updated 2 months ago
- A framework for efficient model inference with omni-modality models☆2,659Updated this week
- slime is an LLM post-training framework for RL Scaling.☆3,668Updated this week
- Muon is Scalable for LLM Training☆1,426Updated 6 months ago
- Lightning-Fast RL for LLM Reasoning and Agents. Made Simple & Flexible.☆3,509Updated this week
- ☆1,283Updated 2 months ago
- ☆1,543Updated 2 months ago
- ☆814Updated 8 months ago
- ☆1,388Updated 4 months ago
- MoBA: Mixture of Block Attention for Long-Context LLMs☆2,044Updated 10 months ago
- An Open-source RL System from ByteDance Seed and Tsinghua AIR☆1,727Updated 8 months ago
- GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning☆2,162Updated 2 weeks ago
- VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo☆1,620Updated this week
- Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…☆3,399Updated last month
- A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.☆3,297Updated 3 weeks ago
- An Efficient and User-Friendly Scaling Library for Reinforcement Learning with Large Language Models☆2,781Updated this week
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆881Updated 6 months ago
- Visual Causal Flow☆2,011Updated last week
- Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities☆1,151Updated 6 months ago
- Build, evaluate and train General Multi-Agent Assistance with ease☆1,119Updated this week
- Official Repo for Open-Reasoner-Zero☆2,087Updated 8 months ago
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆738Updated 8 months ago
- ☆1,773Updated 4 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆902Updated last week
- ☆1,231Updated 3 months ago
- Official PyTorch implementation for "Large Language Diffusion Models"☆3,554Updated 2 months ago
- Scalable toolkit for efficient model reinforcement☆1,293Updated this week
- MMaDA - Open-Sourced Multimodal Large Diffusion Language Models☆1,569Updated 2 months ago
- Democratizing Reinforcement Learning for LLMs☆5,081Updated this week