Cross-GPU KV Cache Marketplace
☆22Nov 12, 2025Updated 4 months ago
Alternatives and similar repositories for kv-marketplace
Users that are interested in kv-marketplace are comparing it to the libraries listed below
Sorting:
- ☆17Dec 19, 2024Updated last year
- Cython based high performance alternative to Python (re) module for doing basic pattern matching on large data-set..☆11Dec 15, 2022Updated 3 years ago
- quick demo scripts for podman desktop☆12Mar 2, 2026Updated 2 weeks ago
- Learn Apache Airflow with examples☆20Oct 10, 2023Updated 2 years ago
- Friendly Terminal Assistant for Developers☆17Mar 23, 2024Updated last year
- The Alternative Self-Hosted Service for Notion Calendar☆11Jan 30, 2024Updated 2 years ago
- A fully open-source, self-hostable data lakehouse for local development and testing of modern data workflows☆36Jan 26, 2026Updated last month
- Official implementation of EMNLP'23 paper "Revisiting Block-based Quantisation: What is Important for Sub-8-bit LLM Inference?"☆24Oct 25, 2023Updated 2 years ago
- QuickClash Revit Add-in for Clash Detection☆11Jun 17, 2022Updated 3 years ago
- Token-efficient Structured Object Notation for LLMs☆44Jan 19, 2026Updated 2 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Wireshark-like forensic analysis for Model Context Protocol communications Capture, inspect, and investigate all HTTP requests and respo…☆158Feb 1, 2026Updated last month
- Serverless RAG application with LlamaIndex and code interperter on Azure Container Apps☆12Jan 30, 2026Updated last month
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- llama2 inference engine in Rust☆13Apr 12, 2024Updated last year
- 自建 Tailscale DERP 服务器 Docker 镜像☆25Mar 9, 2025Updated last year
- 100% Private & Simple. OSS 🐍 Code Interpreter for LLMs 🦙☆34Aug 29, 2023Updated 2 years ago
- A batteries included starter for building an API using bun and Elysia☆16Sep 16, 2025Updated 6 months ago
- ☆12Jan 19, 2024Updated 2 years ago
- Current Alpha version of the ONTO-TRON-5000☆40Dec 1, 2025Updated 3 months ago
- E-prescription app developed with Flutter and Firebase🔥.☆11Mar 31, 2022Updated 3 years ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated last year
- ☆13Jan 30, 2023Updated 3 years ago
- Memory and Context Orchestration for Coding Agents☆36Updated this week
- ☆14Sep 29, 2025Updated 5 months ago
- Waymo Open Dataset on browser visualization tool☆39Mar 2, 2026Updated 2 weeks ago
- ☆20Mar 13, 2026Updated last week
- Reading and Monitoring Related Talks, Blog Posts, and Videos☆10Jun 25, 2017Updated 8 years ago
- ☆45Apr 28, 2024Updated last year
- Mindwrite, is simple flutter project with clean architecture and Bloc☆13Oct 24, 2024Updated last year
- Automated LLM novelist☆46Apr 11, 2024Updated last year
- ☆15Aug 10, 2017Updated 8 years ago
- ☆36Jan 25, 2026Updated last month
- CLI-first server inventory management with YAML as the single source of truth☆54Jan 25, 2026Updated last month
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆22Nov 26, 2025Updated 3 months ago
- ☆11Jan 28, 2024Updated 2 years ago
- ☆14Aug 25, 2024Updated last year
- run ollama & gguf easily with a single command☆52May 15, 2024Updated last year
- An OpenAI API compatible images server to generate or manipulate images.☆17Feb 2, 2025Updated last year