stonet-research/cheops25-IO-characterization-of-LLM-model-kv-cache-offloading-nvme

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stonet-research/cheops25-IO-characterization-of-LLM-model-kv-cache-offloading-nvme)

stonet-research / cheops25-IO-characterization-of-LLM-model-kv-cache-offloading-nvme

☆19

Alternatives and similar repositories for cheops25-IO-characterization-of-LLM-model-kv-cache-offloading-nvme

Users that are interested in cheops25-IO-characterization-of-LLM-model-kv-cache-offloading-nvme are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DKU-StarLab / ConfZNS
View on GitHub
Configuration ZNS SSD emulator
☆28Nov 19, 2024Updated last year
ChaseLab-PKU / InstAttention
View on GitHub
InstAttention: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference
☆18Mar 30, 2025Updated last year
snu-csl / nvmevirt
View on GitHub
NVMeVirt: A Versatile Software-defined Virtual NVMe Device
☆317May 21, 2026Updated 2 months ago
arc-research-lab / AGILE
View on GitHub
AGILE: Lightweight and Efficient Asynchronous GPU-SSD Integration (SC25)
☆24Apr 14, 2026Updated 3 months ago
SNU-HPCS / 3D-FPIM
View on GitHub
3D-FPIM: An Extreme Energy-Efficient DNN Acceleration System Using 3D NAND Flash-Based In-Situ PIM Unit (MICRO 2022)
☆27May 19, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
stonet-research / zns-tools
View on GitHub
A set of tools for understanding F2FS usage of ZNS devices, which allow for identifying the on-device locations of files and inodes, mapp…
☆20Jan 19, 2025Updated last year
hongsunjang / HILOS
View on GitHub
[ASPLOS'26] HILOS: A Cost-Effective Near-Storage Processing Solution for Offline Inference of Long-Context LLMs
☆20Jan 18, 2026Updated 6 months ago
sg20180546 / ZNS-awesome-paper
View on GitHub
Paper related to Zone NameSpace (SSD,HDD)
☆90May 13, 2026Updated 2 months ago
cares-davinci / MQSim-E
View on GitHub
☆21Apr 18, 2024Updated 2 years ago
VIA-Research / SwarmIO
View on GitHub
SwarmIO is an SSD emulation framework for next-generation GPU-centric storage systems research
☆54May 24, 2026Updated 2 months ago
dengls24 / LLM-para
View on GitHub
Analyze LLM inference: FLOPs, memory, Roofline model. Supports GQA, MoE, MLA, RoPE, SwiGLU. 19 models × 20+ hardware platforms.
☆21Apr 16, 2026Updated 3 months ago
jaehongm / eZNS
View on GitHub
☆14Aug 2, 2023Updated 2 years ago
dywsjtu / apparate
View on GitHub
Artifact for "Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving" [SOSP '24]
☆24Nov 21, 2024Updated last year
mj-nvram / NFS
View on GitHub
A Cycle-accurate Microarchitecture-level NAND Flash Memory System Simulation Framework
☆29Mar 20, 2013Updated 13 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
multifacet / Bypassd
View on GitHub
Bypassd is a novel I/O architecture that provides low latency access to shared SSDs.
☆23May 14, 2025Updated last year
nicexlab / GeminiFS
View on GitHub
GeminiFS: A Companion File System for GPUs
☆85Jul 8, 2026Updated 3 weeks ago
sincronia-coflow / implementation
View on GitHub
Sincronia Implementation
☆11Sep 11, 2018Updated 7 years ago
frostschutz / FairNAT
View on GitHub
Fair NAT for Linux Routers - shaper script which allows fair bandwidth sharing among clients in the local network
☆26Mar 22, 2010Updated 16 years ago
ece-fast-lab / cxl_type3_tests
View on GitHub
This is the respository that holds the artifacts of MICRO'23 -- Demystifying CXL Memory with True CXL-Ready Systems and CXL Memory Device…
☆53Mar 17, 2024Updated 2 years ago
Dantali0n / OpenCSD
View on GitHub
OpenCSD: eBPF Computational Storage Device (CSD) for Zoned Namespace (ZNS) SSDs in QEMU
☆69Nov 1, 2023Updated 2 years ago
Adaxry / Unified_Layer_Skipping
View on GitHub
☆15Apr 11, 2024Updated 2 years ago
SamsungDS / TorFS
View on GitHub
TorFS is a plugin that enables RocksDB to access FDP SSDs
☆15Jul 16, 2025Updated last year
LMCache / LMBenchmark
View on GitHub
Systematic and comprehensive benchmarks for LLM systems.
☆62Jan 28, 2026Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
SamsungDS / xZTL
View on GitHub
Zone Translation Layer User-space Library
☆22Sep 15, 2023Updated 2 years ago
SouhailHammou / Custom-VM
View on GitHub
Virtual machine with a custom instruction set in C
☆16Jul 17, 2018Updated 8 years ago
nicktehrany / msF2FS
View on GitHub
multi-streamed F2FS: An NVMe ZNS SSD optimized F2FS File System with concurrently writable hot/warm/cold data streams and application-gui…
☆25Mar 16, 2023Updated 3 years ago
mutonix / pyramidinfer
View on GitHub
☆47Nov 25, 2024Updated last year
YaoJiayi / CacheBlend
View on GitHub
☆201Jul 15, 2025Updated last year
gafert / Apate
View on GitHub
A graphical and educational processor simulator based on the RISC-V instruction set architecture
☆11Apr 28, 2024Updated 2 years ago
caoshiyi / artifacts
View on GitHub
☆40Nov 28, 2024Updated last year
ByteDance-Seed / ShadowKV
View on GitHub
[ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference
☆311May 1, 2025Updated last year
pmem / pmem.github.io
View on GitHub
The pmem.io Website
☆17Jan 20, 2026Updated 6 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Faraz9877 / H100_GEMM
View on GitHub
High-performance GEMM implementation optimized for NVIDIA H100 GPUs, leveraging Hopper architecture's TMA, WGMMA, and Thread Block Cluste…
☆11Dec 4, 2024Updated last year
dimitrs / cpp-opencl
View on GitHub
C++ to OpenCL C Source-to-source Translation
☆13Feb 15, 2014Updated 12 years ago
nicovank / Energy-Languages
View on GitHub
☆14Nov 12, 2025Updated 8 months ago
westerndigitalcorporation / libzbd
View on GitHub
Zoned block device manipulation library and tools
☆77May 30, 2024Updated 2 years ago
arc-research-lab / SCARIF
View on GitHub
SCARIF is a tool to estimate the embodied carbon emissions of data center servers with accelerator hardware (GPUs, FPGAs, etc.)
☆15Updated this week
zyqCSL / DiffKV
View on GitHub
☆45Oct 11, 2025Updated 9 months ago
liblaf / ilatex
View on GitHub
📚 LaTeX templates and tools for creating beautiful, structured documents 📝
☆14Oct 24, 2025Updated 9 months ago