Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)
☆63Apr 14, 2026Updated last month
Alternatives and similar repositories for Storage-for-AI-Paper
Users that are interested in Storage-for-AI-Paper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Jan 21, 2023Updated 3 years ago
- GeminiFS: A Companion File System for GPUs☆77Feb 18, 2025Updated last year
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Sep 19, 2024Updated last year
- ☆15Apr 11, 2024Updated 2 years ago
- Must-read Papers for File System (FS)☆324Dec 17, 2025Updated 5 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆187Jul 10, 2024Updated last year
- DL Dataloader Benchmarks☆20Jan 27, 2025Updated last year
- LLM checkpointing for DeepSpeed/Megatron☆25Nov 30, 2025Updated 6 months ago
- Lustre Repository with MS patches☆17May 19, 2026Updated last week
- ☆38Jan 15, 2021Updated 5 years ago
- A high-performance, thread-safe HashMap and LRU cache for Rust with fine-grained per-key locking.☆15May 13, 2026Updated 2 weeks ago
- ☆10Feb 22, 2023Updated 3 years ago
- A tracing tool to analyze the I/O behavior of a program.☆12Sep 25, 2019Updated 6 years ago
- Gengar, a distributed shared hybrid memory pool with RDMA support. Gengar allows applications to access remote DRAM/NVM in a large and gl…☆24May 24, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Experimental repository for GSoC 2024.☆15Aug 29, 2024Updated last year
- CAM: Asynchronous GPU-Initiated, CPU-Managed SSD Management for Batching Storage Access [ICDE'25]☆19Mar 3, 2025Updated last year
- Write a simple file system from zero.☆12Apr 14, 2024Updated 2 years ago
- A pure userspace filesystem developed on SPDK's block device layer, optimized for high-speed storage devices (NVMe/PMEM) in high-concurre…☆26Mar 11, 2026Updated 2 months ago
- ☆25Mar 31, 2022Updated 4 years ago
- A Fast Graph Update Library for FPGA-based Dynamic Graph Processing☆10Dec 20, 2021Updated 4 years ago
- Cray Lustre is HPE's curated Lustre distro for HPE ClusterStor, Cray EX, and other HPE/Cray clients☆18May 21, 2026Updated last week
- Linux Cross-Memory Attach☆23Apr 21, 2026Updated last month
- Profiling and Improving the PyTorch Dataloader for high-latency Storage☆21Apr 18, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆26Dec 12, 2017Updated 8 years ago
- It's an experiment based on 09 KDD paper, Beyond Blacklists: Learning to Detect Malicious Web Sites from Suspicious URLs☆10Jan 8, 2019Updated 7 years ago
- MLPerf® Storage Benchmark Suite☆181Updated this week
- ☆44Jul 10, 2017Updated 8 years ago
- Accelerating Deep Learning Training Through Transparent Storage Tiering (CCGrid'22)☆19Dec 13, 2022Updated 3 years ago
- DLL注入工具☆12Nov 9, 2020Updated 5 years ago
- Google DeepMind: Mixture of Depths Unofficial Implementation.☆12May 29, 2024Updated 2 years ago
- NVMe based File System in User-space☆111Feb 16, 2020Updated 6 years ago
- ☆32May 28, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- libhadoop is a pure c/c++ liberary for hadoop hdfs like libhdfs☆19Aug 6, 2012Updated 13 years ago
- VaniDL is an tool for analyzing I/O patterns and behavior with Deep Learning Applications.☆10Jul 8, 2022Updated 3 years ago
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".☆48Jul 12, 2024Updated last year
- 基于folly、wangle和proxygen的c++11基础库☆11Apr 29, 2018Updated 8 years ago
- An I/O benchmark for deep Learning applications☆105Mar 18, 2026Updated 2 months ago
- ☆133Nov 11, 2024Updated last year
- ☆12Mar 26, 2024Updated 2 years ago