High-performance KV cache storage for LLM inference — GPU offloading, SSD caching, and cross-node sharing via RDMA. Works with vLLM and SGLang.
☆46Apr 26, 2026Updated this week
Alternatives and similar repositories for pegaflow
Users that are interested in pegaflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- learn to make OS☆11Nov 21, 2020Updated 5 years ago
- Where is my space?☆41Mar 12, 2026Updated last month
- A multiboot OS that prints☆15Nov 5, 2025Updated 5 months ago
- 15-721 Spring 2024 - Cache #1☆12May 2, 2024Updated last year
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Metis: File System Model Checking via Versatile Input and State Exploration (FAST '24)☆14Mar 18, 2025Updated last year
- KV cache store for distributed LLM inference☆410Nov 13, 2025Updated 5 months ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆30Jan 22, 2026Updated 3 months ago
- ☆10Jul 28, 2021Updated 4 years ago
- OS with Rust and UEFI☆17Jan 8, 2023Updated 3 years ago
- A comprehensive open-source cache trace dataset☆24Aug 23, 2025Updated 8 months ago
- SMILES Toolkit☆25Jul 9, 2025Updated 9 months ago
- -☆11Nov 21, 2020Updated 5 years ago
- A lightweight, pluggable spec renderer built by Kong. Designed to power fast, customizable API documentation experiences.☆33Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Cache Simulator specialized for flash caching for bulk storage systems)☆12Jan 16, 2024Updated 2 years ago
- waverless: A serverless framework written by rust with WASM, CRIU, FunctionGraph, Integrated Storage☆13May 22, 2025Updated 11 months ago
- Write yourself a simply-typed lambda calculus using Rust in a week!☆13May 13, 2024Updated last year
- Tensara's GPU programming problems☆18Apr 23, 2026Updated last week
- Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon☆45Mar 31, 2026Updated last month
- ☆30Jun 7, 2025Updated 10 months ago
- JEDI: model-driven trace generation for cache simulations☆17Oct 2, 2025Updated 6 months ago
- AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solu…☆253Updated this week
- easy-phi☆25Feb 4, 2015Updated 11 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆28Mar 17, 2024Updated 2 years ago
- Lets build a Deep Learning Framework!☆26Mar 12, 2026Updated last month
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- ☆20May 9, 2023Updated 2 years ago
- GPU-Accelerated Cosine Similarity for Tandem Mass Spectrometry☆17Nov 4, 2025Updated 5 months ago
- Face Animation from Text☆18Aug 1, 2024Updated last year
- [SIGMOD '24] CaaS-LSM: Compaction-as-a-Service for LSM-based Key-Value Stores in Storage Disaggregated Infrastructure☆70Jul 2, 2024Updated last year
- Detect and remove unused dependencies for Python projects☆18Apr 5, 2025Updated last year
- gorobots - robots.txt recon & path discovery in Go. Structured parsing, 29-category sensitivity classification, concurrent path probing, …☆16Apr 10, 2026Updated 3 weeks ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Official codebase for our paper "Do Language Models Use Their Depth Efficiently?"☆29Jun 25, 2025Updated 10 months ago
- [AFK] Hardware router in Chisel (THU Network Joint Lab 2020)☆14Oct 8, 2020Updated 5 years ago
- Copy and paste buffer content or file path in Nvim-Tree, Neo-Tree, Oil to another tmux pane in Neovim.☆20Jan 24, 2026Updated 3 months ago
- A lightweight, configurable, and real-time simulator designed to mimic the behavior of vLLM without the need for GPUs or running actual h…☆119Updated this week
- ☆16Jul 9, 2024Updated last year
- My CTF Challenges. No one plays.☆13Dec 4, 2022Updated 3 years ago
- Burp Suite plugin created for using Collaborator tool during manual testing☆19Feb 4, 2022Updated 4 years ago