Cross-GPU KV Cache Marketplace
☆22Nov 12, 2025Updated 4 months ago
Alternatives and similar repositories for kv-marketplace
Users that are interested in kv-marketplace are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cython based high performance alternative to Python (re) module for doing basic pattern matching on large data-set..☆11Dec 15, 2022Updated 3 years ago
- QuickClash Revit Add-in for Clash Detection☆11Jun 17, 2022Updated 3 years ago
- Serverless RAG application with LlamaIndex and code interperter on Azure Container Apps☆12Jan 30, 2026Updated 2 months ago
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- llama2 inference engine in Rust☆13Apr 12, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 100% Private & Simple. OSS 🐍 Code Interpreter for LLMs 🦙☆34Aug 29, 2023Updated 2 years ago
- 自建 Tailscale DERP 服务器 Docker 镜像☆26Mar 9, 2025Updated last year
- E-prescription app developed with Flutter and Firebase🔥.☆11Mar 31, 2022Updated 4 years ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- ☆13Jan 30, 2023Updated 3 years ago
- ☆21Apr 2, 2026Updated last week
- CLI-first server inventory management with YAML as the single source of truth☆54Jan 25, 2026Updated 2 months ago
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆23Nov 26, 2025Updated 4 months ago
- ☆11Jan 28, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14Aug 25, 2024Updated last year
- Oobabooga "Hello World" API example for node.js with Express☆13Jul 2, 2023Updated 2 years ago
- A fully open-source, self-hostable data lakehouse for local development and testing of modern data workflows☆79Mar 18, 2026Updated 3 weeks ago
- ☆18Jan 4, 2024Updated 2 years ago
- Vector search using only Parquet and DataFusion☆55Feb 11, 2026Updated last month
- ☆17Dec 18, 2023Updated 2 years ago
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 2 years ago
- MCP Server for Ghidra. Exposes tools to be used by AI-powered reverse engineers.☆16Mar 29, 2025Updated last year
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- reimagine the implementation of C-3PO droid voice synthesizer and multilingual translation and communication capabilities with the latest…☆12Mar 6, 2024Updated 2 years ago
- Reproducing GPT on the TinyStories dataset☆19Jan 18, 2024Updated 2 years ago
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 2 years ago
- This is a training method to produce a split brain model☆14Mar 7, 2025Updated last year
- this is a dungeon ai run locally that use your llm in the terminal with multiple players from 2 to 5☆16Jan 25, 2026Updated 2 months ago
- A mini NoSQL for learning similar to leveldb☆10Dec 22, 2019Updated 6 years ago
- Implementation of stop sequencer for Huggingface Transformers☆16Jun 6, 2023Updated 2 years ago
- We enable LLM with personalization capability☆11Nov 16, 2023Updated 2 years ago
- ☆19Jun 5, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Graph Convolutional Neural Networks for Alzheimer’s Classification with transfer learning and HPC methods☆12Sep 20, 2021Updated 4 years ago
- Let's have some retro gaming fun with AI! Join the discord: https://discord.gg/5xXzkMu8Zk☆75Nov 19, 2025Updated 4 months ago
- ☆12Apr 4, 2024Updated 2 years ago
- Jupyter notebook templates for processing and analyzing neuroscience data.☆17Mar 12, 2026Updated 3 weeks ago
- A guide to testing different runpod (and other linux VMs) configurations. Specifically the speed of LLM outputs☆17Jan 12, 2024Updated 2 years ago
- Toolchain which generates Modelica building models from BIM models☆21May 2, 2020Updated 5 years ago
- heterogeneous graph attention network for SMEs bankruptcy prediction☆13Feb 26, 2021Updated 5 years ago