deepseek-ai / 3FSLinks
A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
☆9,633Updated last week
Alternatives and similar repositories for 3FS
Users that are interested in 3FS are comparing it to the libraries listed below
Sorting:
- A lightweight data processing framework built on DuckDB and 3FS.☆4,901Updated 10 months ago
- DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling☆6,098Updated last week
- A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.☆2,905Updated last week
- DeepEP: an efficient expert-parallel communication library☆8,898Updated 3 weeks ago
- Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation☆7,957Updated 8 months ago
- FlashMLA: Efficient Multi-head Latent Attention Kernels☆11,979Updated last week
- Expert Parallelism Load Balancer☆1,334Updated 10 months ago
- Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.☆4,600Updated this week
- Analyze computation-communication overlap in V3/R1.☆1,136Updated 10 months ago
- A Datacenter Scale Distributed Inference Serving Framework☆5,793Updated last week
- FlashInfer: Kernel Library for LLM Serving☆4,707Updated this week
- SGLang is a high-performance serving framework for large language models and multimodal models.☆22,556Updated this week
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆4,739Updated last week
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,305Updated 6 months ago
- Supercharge Your LLM with the Fastest KV Cache Layer☆6,750Updated this week
- Nano vLLM☆10,892Updated 2 months ago
- Fully open reproduction of DeepSeek-R1☆25,825Updated 2 months ago
- Minimal reproduction of DeepSeek R1-Zero☆12,598Updated 9 months ago
- Cost-efficient and pluggable Infrastructure components for GenAI inference☆4,532Updated last week
- Tile primitives for speedy kernels☆3,096Updated last week
- A compact implementation of SGLang, designed to demystify the complexities of modern LLM serving systems.☆3,067Updated this week
- s1: Simple test-time scaling☆6,631Updated 7 months ago
- DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding☆5,191Updated 10 months ago
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆26,197Updated 2 weeks ago
- A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.☆3,721Updated last month
- Renderer for the harmony response format to be used with gpt-oss☆4,146Updated last month
- DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model☆4,989Updated last year
- Mirage Persistent Kernel: Compiling LLMs into a MegaKernel☆2,084Updated 2 weeks ago
- verl: Volcano Engine Reinforcement Learning for LLMs☆18,535Updated this week
- Democratizing Reinforcement Learning for LLMs☆4,995Updated last week