Cross-GPU KV Cache Marketplace
☆22Nov 12, 2025Updated 6 months ago
Alternatives and similar repositories for kv-marketplace
Users that are interested in kv-marketplace are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Friendly Terminal Assistant for Developers☆17Mar 23, 2024Updated 2 years ago
- The Alternative Self-Hosted Service for Notion Calendar☆11Jan 30, 2024Updated 2 years ago
- QuickClash Revit Add-in for Clash Detection☆11Jun 17, 2022Updated 3 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Serverless RAG application with LlamaIndex and code interperter on Azure Container Apps☆13Jan 30, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆11Dec 3, 2024Updated last year
- 100% Private & Simple. OSS 🐍 Code Interpreter for LLMs 🦙☆34Aug 29, 2023Updated 2 years ago
- 自建 Tailscale DERP 服务器 Docker 镜像☆27Mar 9, 2025Updated last year
- Current Alpha version of the ONTO-TRON-5000☆41Dec 1, 2025Updated 6 months ago
- E-prescription app developed with Flutter and Firebase🔥.☆10Mar 31, 2022Updated 4 years ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- Reading and Monitoring Related Talks, Blog Posts, and Videos☆10Jun 25, 2017Updated 8 years ago
- Browser-based 3D perception explorer for Waymo, nuScenes, and Argoverse 2☆71May 30, 2026Updated last week
- CLI-first server inventory management with YAML as the single source of truth☆55Jan 25, 2026Updated 4 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆24Nov 26, 2025Updated 6 months ago
- ☆11Jan 28, 2024Updated 2 years ago
- run ollama & gguf easily with a single command☆52May 15, 2024Updated 2 years ago
- An OpenAI API compatible images server to generate or manipulate images.☆18Feb 2, 2025Updated last year
- Resources on personal finance and investing!☆13Aug 29, 2021Updated 4 years ago
- L2E llama2.c on Commodore C-64☆18Feb 22, 2025Updated last year
- jQuery, React and Streamlit applications written by LLMs☆16Dec 24, 2023Updated 2 years ago
- ☆17Dec 18, 2023Updated 2 years ago
- ☆12Feb 4, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆14Sep 4, 2024Updated last year
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- Official GitHub repository of the lecture "Multimodal Deep Learning for Recommendation", at the 2024 ACM RecSys Summer School☆12Oct 12, 2024Updated last year
- ☆14Dec 21, 2025Updated 5 months ago
- reimagine the implementation of C-3PO droid voice synthesizer and multilingual translation and communication capabilities with the latest…☆12Mar 6, 2024Updated 2 years ago
- Reproducing GPT on the TinyStories dataset☆19Jan 18, 2024Updated 2 years ago
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 3 years ago
- Pipeline parallelism for the minimalist☆39Aug 6, 2025Updated 10 months ago
- this is a dungeon ai run locally that use your llm in the terminal with multiple players from 2 to 5☆16Jan 25, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementation of stop sequencer for Huggingface Transformers☆16Jun 6, 2023Updated 3 years ago
- A mini NoSQL for learning similar to leveldb☆10Dec 22, 2019Updated 6 years ago
- We enable LLM with personalization capability☆11Nov 16, 2023Updated 2 years ago
- Graph Convolutional Neural Networks for Alzheimer’s Classification with transfer learning and HPC methods☆12Sep 20, 2021Updated 4 years ago
- Data Structures and Algorithms Practice☆12May 26, 2023Updated 3 years ago
- A small framework for benchmarking machine learning models.☆22Jun 6, 2025Updated last year
- Jupyter notebook templates for processing and analyzing neuroscience data.☆17Jun 2, 2026Updated last week