novitalabs/pegaflow

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/novitalabs/pegaflow)

novitalabs / pegaflow

High-performance KV cache storage for LLM inference — GPU offloading, SSD caching, and cross-node sharing via RDMA. Works with vLLM and SGLang.

☆74

Alternatives and similar repositories for pegaflow

Users that are interested in pegaflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

belowthetree / Make-a-system
View on GitHub
learn to make OS
☆11Nov 21, 2020Updated 5 years ago
waynexia / cargo-gc
View on GitHub
Where is my space?
☆41Mar 12, 2026Updated 2 months ago
cmu-db / 15721-s24-cache1
View on GitHub
15-721 Spring 2024 - Cache #1
☆12May 2, 2024Updated 2 years ago
knoway-dev / knoway
View on GitHub
An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises
☆26Apr 24, 2025Updated last year
sbu-fsl / Metis
View on GitHub
Metis: File System Model Checking via Versatile Input and State Exploration (FAST '24)
☆15Mar 18, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
bytedance / InfiniStore
View on GitHub
KV cache store for distributed LLM inference
☆419Nov 13, 2025Updated 6 months ago
HydraQYH / hp_rms_norm
View on GitHub
High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)
☆30Jan 22, 2026Updated 3 months ago
PortSwigger / site-map-extractor
View on GitHub
☆10Jul 28, 2021Updated 4 years ago
xfoxfu / rust-xos
View on GitHub
OS with Rust and UEFI
☆17Jan 8, 2023Updated 3 years ago
cacheMon / cache_dataset
View on GitHub
A comprehensive open-source cache trace dataset
☆25Aug 23, 2025Updated 8 months ago
fsspec / alluxiofs
View on GitHub
Speed up fsspec data access with Alluxio distributed caching.
☆18Mar 22, 2026Updated last month
w9w / Open_Redirects_list
View on GitHub
-
☆11Nov 21, 2020Updated 5 years ago
wonglkd / BCacheSim
View on GitHub
Cache Simulator specialized for flash caching for bulk storage systems)
☆13Jan 16, 2024Updated 2 years ago
xzhseh / stlc-in-a-week
View on GitHub
Write yourself a simply-typed lambda calculus using Rust in a week!
☆13May 13, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
tensara / problems
View on GitHub
Tensara's GPU programming problems
☆20Apr 23, 2026Updated 3 weeks ago
moonquest-ai / SRDA
View on GitHub
☆30Jun 7, 2025Updated 11 months ago
UMass-LIDS / Jedi
View on GitHub
JEDI: model-driven trace generation for cache simulations
☆17Oct 2, 2025Updated 7 months ago
easy-phi / main
View on GitHub
easy-phi
☆25Feb 4, 2015Updated 11 years ago
vipuldivyanshu92 / ANEgpt
View on GitHub
☆83Mar 3, 2026Updated 2 months ago
ziyueqiu / FrozenHot
View on GitHub
☆28Mar 17, 2024Updated 2 years ago
poleval / 2021-punctuation-restoration
View on GitHub
PolEval 2021 Task 1
☆15Jun 28, 2022Updated 3 years ago
ai-dynamo / aiperf
View on GitHub
AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solu…
☆320Updated this week
ucare-uchicago / Queenie
View on GitHub
A user-level tool for extracting SSD internal properties
☆19Apr 8, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
flaneur2020 / rimio
View on GitHub
☆35Feb 14, 2026Updated 3 months ago
python-lapidary / lapidary
View on GitHub
Write Web API clients using annotations in python
☆16May 1, 2026Updated 2 weeks ago
asu-idi / CaaS-LSM
View on GitHub
[SIGMOD '24] CaaS-LSM: Compaction-as-a-Service for LSM-based Key-Value Stores in Storage Disaggregated Infrastructure
☆72Jul 2, 2024Updated last year
pdm-project / pdm-shear
View on GitHub
Detect and remove unused dependencies for Python projects
☆18Apr 5, 2025Updated last year
RobertCsordas / llm_effective_depth
View on GitHub
Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"
☆29Jun 25, 2025Updated 10 months ago
alibaba / ServeGen
View on GitHub
A framework for generating realistic LLM serving workloads
☆127May 11, 2026Updated last week
AndrewJNg / NPU-on-rk3588
View on GitHub
☆13Jan 31, 2024Updated 2 years ago
kiyoon / tmux-send.nvim
View on GitHub
Copy and paste buffer content or file path in Nvim-Tree, Neo-Tree, Oil to another tmux pane in Neovim.
☆20Jan 24, 2026Updated 3 months ago
llm-d / llm-d-inference-sim
View on GitHub
A lightweight, configurable, and real-time simulator designed to mimic the behavior of vLLM without the need for GPUs or running actual h…
☆127May 14, 2026Updated last week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Anemll / mlx-rdma
View on GitHub
experiments with MLX
☆68Dec 15, 2025Updated 5 months ago
jonhoo / rust-zipf
View on GitHub
Rust implementation of a fast, bounded, Zipf-distributed random number generator
☆33Feb 8, 2025Updated last year
ubaldot / vim-markdown-extras
View on GitHub
Taking notes and editing markdown files: make it easy!
☆13Mar 5, 2026Updated 2 months ago
penberg / bwtree-rs
View on GitHub
Bw-Tree for Rust
☆29May 21, 2023Updated 3 years ago
alumik / dblp-api
View on GitHub
A helper package to get information of scholarly articles from DBLP using its public API
☆16May 13, 2025Updated last year
dmemsys / Aceso
View on GitHub
This is the implementation repository of our SOSP'24 paper: Aceso: Achieving Efficient Fault Tolerance in Memory-Disaggregated Key-Value …
☆24Oct 20, 2024Updated last year
bloomberg / rwl-bench
View on GitHub
A set of benchmark tools for reader/writer locks.
☆28Jul 3, 2018Updated 7 years ago