High-performance KV cache storage for LLM inference — GPU offloading, SSD caching, and cross-node sharing via RDMA. Works with vLLM and SGLang.
☆27Mar 20, 2026Updated this week
Alternatives and similar repositories for pegaflow
Users that are interested in pegaflow are comparing it to the libraries listed below
Sorting:
- Магрепорт - свободно-рапространяемая система корпоративной отчётности и BI☆11Apr 9, 2025Updated 11 months ago
- ☆22Updated this week
- Parser combinators in Kotlin for Kotlin Multiplatform☆19May 13, 2022Updated 3 years ago
- Where is my space?☆41Mar 12, 2026Updated last week
- Tool for simplifying data labeling☆14Apr 5, 2023Updated 2 years ago
- Tool for simplifying data labeling☆19Apr 19, 2023Updated 2 years ago
- 15-721 Spring 2024 - Cache #1☆12May 2, 2024Updated last year
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated 10 months ago
- Metis: File System Model Checking via Versatile Input and State Exploration (FAST '24)☆13Mar 18, 2025Updated last year
- A handy plugin for copying requests/responses directly from Burp, some extra magic included.☆13Oct 15, 2021Updated 4 years ago
- High performance RMSNorm Implement by using SM Core Storage(Registers and Shared Memory)☆29Jan 22, 2026Updated 2 months ago
- ☆10Jul 28, 2021Updated 4 years ago
- MCP server for Nile Database - Manage and query databases, tenants, users, auth using LLMs☆16Mar 10, 2025Updated last year
- Rust simple async runtime constructions book.☆17Apr 14, 2024Updated last year
- The official repository for our EMNLP 2024 paper, Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretab…☆20Feb 23, 2025Updated last year
- A comprehensive open-source cache trace dataset☆24Aug 23, 2025Updated 6 months ago
- Speed up fsspec data access with Alluxio distributed caching.☆18Jan 5, 2026Updated 2 months ago
- -☆11Nov 21, 2020Updated 5 years ago
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- A set of AI-powered slash commands for Claude Code and OpenSkills (support Cursor, Windsurf and Gemini CLI) that help you understand any …☆34Mar 16, 2026Updated last week
- AIPerf is a comprehensive benchmarking tool that measures the performance of generative AI models served by your preferred inference solu…☆182Updated this week
- Font Synthesis with Pixel-Space Diffusion Transformer☆122Mar 8, 2026Updated 2 weeks ago
- game engine☆34May 27, 2020Updated 5 years ago
- Cache Simulator specialized for flash caching for bulk storage systems)☆12Jan 16, 2024Updated 2 years ago
- waverless: A serverless framework written by rust with WASM, CRIU, FunctionGraph, Integrated Storage☆14May 22, 2025Updated 10 months ago
- Write yourself a simply-typed lambda calculus using Rust in a week!☆13May 13, 2024Updated last year
- OpenCode GUI extension for VSCode☆24Mar 11, 2026Updated last week
- IntelliJ IDEA plugin for Frege language☆38May 21, 2022Updated 3 years ago
- WordPress REST API SDK for Laravel☆12May 26, 2024Updated last year
- A streaming server written in C++, with Python bindings and a web application with a live shell to demonstrate how it works. Uses the liv…☆19Oct 1, 2021Updated 4 years ago
- Triton‑style kernel toolkit for MLX plus a small upstream incubator: prototype, benchmark, and upstream fusions for Apple Silicon☆40Mar 3, 2026Updated 2 weeks ago
- ☆30Jun 7, 2025Updated 9 months ago
- JEDI: model-driven trace generation for cache simulations☆17Oct 2, 2025Updated 5 months ago
- A Human-in-the-Loop Workflow for Scientific Schema Mining with Large Language Models☆32Updated this week
- easy-phi☆25Feb 4, 2015Updated 11 years ago
- Game Engine From Scratch -- Rust China Conference 2020 topic by LemonHX and his team.☆14Dec 16, 2020Updated 5 years ago
- ☆28Mar 17, 2024Updated 2 years ago
- An example platform daemon in Rust; written for Mastering Embedded Linux☆12May 8, 2020Updated 5 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago