Cross-GPU KV Cache Marketplace
☆22Nov 12, 2025Updated 6 months ago
Alternatives and similar repositories for kv-marketplace
Users that are interested in kv-marketplace are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Friendly Terminal Assistant for Developers☆17Mar 23, 2024Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- 100% Private & Simple. OSS 🐍 Code Interpreter for LLMs 🦙☆34Aug 29, 2023Updated 2 years ago
- Current Alpha version of the ONTO-TRON-5000☆41Dec 1, 2025Updated 5 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Automated LLM novelist☆47Apr 11, 2024Updated 2 years ago
- Browser-based 3D perception explorer for Waymo, nuScenes, and Argoverse 2☆69Mar 21, 2026Updated last month
- ☆37Jan 25, 2026Updated 3 months ago
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆24Nov 26, 2025Updated 5 months ago
- ☆11Jan 28, 2024Updated 2 years ago
- run ollama & gguf easily with a single command☆52May 15, 2024Updated 2 years ago
- An OpenAI API compatible images server to generate or manipulate images.☆18Feb 2, 2025Updated last year
- Oobabooga "Hello World" API example for node.js with Express☆13Jul 2, 2023Updated 2 years ago
- L2E llama2.c on Commodore C-64☆18Feb 22, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆16Dec 16, 2024Updated last year
- ☆17Dec 18, 2023Updated 2 years ago
- replacement of AdamW and Lion optimizer for LLMs☆13May 28, 2023Updated 2 years ago
- ☆12Feb 4, 2025Updated last year
- ☆14Sep 4, 2024Updated last year
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- ☆14Dec 21, 2025Updated 4 months ago
- reimagine the implementation of C-3PO droid voice synthesizer and multilingual translation and communication capabilities with the latest…☆12Mar 6, 2024Updated 2 years ago
- Reproducing GPT on the TinyStories dataset☆20Jan 18, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A Toolkit for Fine-Tuning Large Language Models with LoRA and DeepSpeed☆11Apr 14, 2023Updated 3 years ago
- Pipeline parallelism for the minimalist☆39Aug 6, 2025Updated 9 months ago
- this is a dungeon ai run locally that use your llm in the terminal with multiple players from 2 to 5☆16Jan 25, 2026Updated 3 months ago
- Implementation of stop sequencer for Huggingface Transformers☆16Jun 6, 2023Updated 2 years ago
- ☆19Jun 5, 2023Updated 2 years ago
- We enable LLM with personalization capability☆11Nov 16, 2023Updated 2 years ago
- Graph Convolutional Neural Networks for Alzheimer’s Classification with transfer learning and HPC methods☆12Sep 20, 2021Updated 4 years ago
- ☆12Apr 4, 2024Updated 2 years ago
- A small framework for benchmarking machine learning models.☆22Jun 6, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A guide to testing different runpod (and other linux VMs) configurations. Specifically the speed of LLM outputs☆17Jan 12, 2024Updated 2 years ago
- Let's have some retro gaming fun with AI! Join the discord: https://discord.gg/5xXzkMu8Zk☆80Nov 19, 2025Updated 6 months ago
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API☆24Aug 1, 2024Updated last year
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- CI scripts designed to build a Pascal-compatible version of vLLM.☆12Aug 10, 2024Updated last year
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year