hegongshan / Storage-for-AI-PaperView external linksLinks
Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)
☆57Dec 17, 2025Updated last month
Alternatives and similar repositories for Storage-for-AI-Paper
Users that are interested in Storage-for-AI-Paper are comparing it to the libraries listed below
Sorting:
- ☆16Jan 21, 2023Updated 3 years ago
- GeminiFS: A Companion File System for GPUs☆72Feb 18, 2025Updated 11 months ago
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Sep 19, 2024Updated last year
- ☆24Jun 21, 2023Updated 2 years ago
- ☆15Apr 11, 2024Updated last year
- Write a simple file system from zero.☆12Apr 14, 2024Updated last year
- Must-read Papers for File System (FS)☆317Dec 17, 2025Updated last month
- ☆16Apr 13, 2024Updated last year
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆174Jul 10, 2024Updated last year
- ☆38Jan 15, 2021Updated 5 years ago
- ☆16Jul 24, 2023Updated 2 years ago
- Accelerating Deep Learning Training Through Transparent Storage Tiering (CCGrid'22)☆19Dec 13, 2022Updated 3 years ago
- Profiling and Improving the PyTorch Dataloader for high-latency Storage☆20Apr 18, 2023Updated 2 years ago
- Gengar, a distributed shared hybrid memory pool with RDMA support. Gengar allows applications to access remote DRAM/NVM in a large and gl…☆24May 24, 2022Updated 3 years ago
- A persistent key-value store that is embeddable and optimized for fast storage.☆36Oct 24, 2024Updated last year
- NVMe based File System in User-space☆110Feb 16, 2020Updated 5 years ago
- ☆29May 28, 2024Updated last year
- ☆36Oct 27, 2020Updated 5 years ago
- Lustre Repository with MS patches☆13Feb 6, 2026Updated last week
- A tracing tool to analyze the I/O behavior of a program.☆12Sep 25, 2019Updated 6 years ago
- ☆43Jun 7, 2024Updated last year
- LiteIO is a cloud-native block device service that uses multiple storage engines, including SPDK and LVM, to achieve high performance. It…☆319Feb 6, 2024Updated 2 years ago
- SHADE: Enable Fundamental Cacheability for Distributed Deep Learning Training☆36Mar 1, 2023Updated 2 years ago
- Multi-Candidate Speculative Decoding☆39Apr 22, 2024Updated last year
- ☆42Jun 13, 2025Updated 8 months ago
- a native c/c++ hdfs client (downstream fork from apache-hawq)☆40Aug 14, 2024Updated last year
- Wait for async tasks☆13Dec 22, 2022Updated 3 years ago
- NUST-API集合☆10Oct 29, 2018Updated 7 years ago
- Repo for transient training paper at ICAC 2019.☆11Oct 5, 2022Updated 3 years ago
- Ginex: SSD-enabled Billion-scale Graph Neural Network Training on a Single Machine via Provably Optimal In-memory Caching☆41Jul 10, 2024Updated last year
- Decoding Attention is specially optimized for MHA, MQA, GQA and MLA using CUDA core for the decoding stage of LLM inference.☆46Jun 11, 2025Updated 8 months ago
- 🎓Automatically Update Distributed Learning Papers Daily using Github Actions (Update Every 12th hours)☆47Updated this week
- Codinfox theme for Zola static website generator☆11May 10, 2023Updated 2 years ago
- Texture Block Compression (BCn) written in Rust☆11Apr 12, 2021Updated 4 years ago
- ☆11Apr 10, 2025Updated 10 months ago
- "Building Distributed Systems with Stateright"☆15Jul 28, 2025Updated 6 months ago
- Further developed as SyntaxDot: https://github.com/tensordot/syntaxdot☆13Dec 18, 2020Updated 5 years ago
- 微信Ipad协议golang版本,基于grpc的实现策略。这套代码需要通过gprc服务端组包解包才可以正常使用☆12Jul 8, 2019Updated 6 years ago
- ☆12Nov 8, 2024Updated last year