Accelerating AI Training and Inference from Storage Perspective (Must-read Papers on Storage for AI)
☆58Mar 26, 2026Updated this week
Alternatives and similar repositories for Storage-for-AI-Paper
Users that are interested in Storage-for-AI-Paper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code based on vLLM for the paper “ Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”.☆11Sep 19, 2024Updated last year
- ☆23Jun 21, 2023Updated 2 years ago
- Must-read Papers for File System (FS)☆318Dec 17, 2025Updated 3 months ago
- InfiniGen: Efficient Generative Inference of Large Language Models with Dynamic KV Cache Management (OSDI'24)☆182Jul 10, 2024Updated last year
- DL Dataloader Benchmarks☆20Jan 27, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- LLM checkpointing for DeepSpeed/Megatron☆25Nov 30, 2025Updated 4 months ago
- Lustre Repository with MS patches☆15Mar 22, 2026Updated last week
- ☆38Jan 15, 2021Updated 5 years ago
- A high-performance, thread-safe HashMap and LRU cache for Rust with fine-grained per-key locking.☆15Mar 10, 2026Updated 2 weeks ago
- Everything you always wanted to know about ANNS but were afraid to ask 🥰☆11Dec 21, 2024Updated last year
- A tracing tool to analyze the I/O behavior of a program.☆12Sep 25, 2019Updated 6 years ago
- Gengar, a distributed shared hybrid memory pool with RDMA support. Gengar allows applications to access remote DRAM/NVM in a large and gl…☆24May 24, 2022Updated 3 years ago
- CAM: Asynchronous GPU-Initiated, CPU-Managed SSD Management for Batching Storage Access [ICDE'25]☆18Mar 3, 2025Updated last year
- Write a simple file system from zero.☆12Apr 14, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A pure userspace filesystem developed on SPDK's block device layer, optimized for high-speed storage devices (NVMe/PMEM) in high-concurre…☆26Mar 11, 2026Updated 2 weeks ago
- ☆26Mar 31, 2022Updated 3 years ago
- A machine learning framework with readable source code☆14Apr 30, 2025Updated 10 months ago
- ☆216Nov 23, 2025Updated 4 months ago
- Profiling and Improving the PyTorch Dataloader for high-latency Storage☆20Apr 18, 2023Updated 2 years ago
- ☆26Dec 12, 2017Updated 8 years ago
- It's an experiment based on 09 KDD paper, Beyond Blacklists: Learning to Detect Malicious Web Sites from Suspicious URLs☆10Jan 8, 2019Updated 7 years ago
- MLPerf® Storage Benchmark Suite☆176Updated this week
- ☆44Jul 10, 2017Updated 8 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Accelerating Deep Learning Training Through Transparent Storage Tiering (CCGrid'22)☆19Dec 13, 2022Updated 3 years ago
- ☆42Jun 13, 2025Updated 9 months ago
- DLL注入工具☆12Nov 9, 2020Updated 5 years ago
- Ceph is a distributed object, block, and file storage platform☆10Jun 12, 2025Updated 9 months ago
- The modern, type-safe process injection framework for Red Teams and Offensive Security in Rust.☆33Dec 15, 2025Updated 3 months ago
- NVMe based File System in User-space☆109Feb 16, 2020Updated 6 years ago
- libhadoop is a pure c/c++ liberary for hadoop hdfs like libhdfs☆19Aug 6, 2012Updated 13 years ago
- VaniDL is an tool for analyzing I/O patterns and behavior with Deep Learning Applications.☆10Jul 8, 2022Updated 3 years ago
- Official implementation of ICML 2024 paper "ExCP: Extreme LLM Checkpoint Compression via Weight-Momentum Joint Shrinking".☆47Jul 12, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 基于folly、wangle和proxygen的c++11基础库☆11Apr 29, 2018Updated 7 years ago
- ☆15Jan 28, 2024Updated 2 years ago
- ☆12Mar 26, 2024Updated 2 years ago
- Multi-Candidate Speculative Decoding☆40Apr 22, 2024Updated last year
- Llama causal LM fully recreated in LibTorch. Designed to be used in Unreal Engine 5☆16Sep 19, 2024Updated last year
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆17Aug 21, 2023Updated 2 years ago
- Unreal Engine 5 3D Platformer game prototype☆17May 27, 2024Updated last year