Kubernetes operator for local LLM inference with llama.cpp, vLLM, and TGI - multi-GPU, autoscaling, air-gapped, production-ready
☆48Apr 11, 2026Updated this week
Alternatives and similar repositories for LLMKube
Users that are interested in LLMKube are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- S3 Object Storage using Hetzners Storagebox, Seaweedfs and a cheap VPS☆18Jul 15, 2024Updated last year
- c9watch (short for claude code watch, like k8s for Kubernetes) is a macOS desktop app that gives you a real-time dashboard of every Claud…☆63Updated this week
- A Model Context Protocol (MCP) server for OpenSCAD 3D modeling and rendering☆64Feb 15, 2026Updated last month
- Listen to Radio Paradise in lossless FLAC quality on your Squeezebox (Lyrion Music Server - fka. Logitech Media Server)☆15Mar 22, 2026Updated 3 weeks ago
- Small project to capture BLE data from a xiaomi hygrometer-thermometer.☆19Aug 29, 2020Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code and other materials for the S2I2 Software Summer School☆12Mar 11, 2017Updated 9 years ago
- The link to the website is at☆14Aug 12, 2015Updated 10 years ago
- rtl_433 to MQTT docker☆29Mar 27, 2021Updated 5 years ago
- ☆21Jun 8, 2025Updated 10 months ago
- Free IPTV lista Balkan Ex-Yu kanala☆17Aug 3, 2025Updated 8 months ago
- CLI for creating github gists☆14Apr 20, 2017Updated 8 years ago
- ☆19Mar 17, 2026Updated 3 weeks ago
- ☆17Jan 1, 2025Updated last year
- Create and manage your Notebooks on Kubernetes with ease.☆21Mar 10, 2023Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- This is obsolete. Use this:☆21Aug 6, 2016Updated 9 years ago
- Project to manage Flux tasks needed to standardize kubernetes HPC scheduling interfaces☆28Jan 9, 2026Updated 3 months ago
- a concurrent hash array mapped trie implementation in go☆59Jun 19, 2025Updated 9 months ago
- Why you should learn about Global Storage Databases☆24May 14, 2021Updated 4 years ago
- A curated list of my GitHub stars!☆15Updated this week
- Synchronises Ableton Live to a live input such as microphone in the room or DJ playing record☆11Aug 14, 2014Updated 11 years ago
- Terraform-Based Bedrock RAG Deployment☆10Sep 17, 2024Updated last year
- Agentic Context Engineering Paper Implementation☆65Oct 11, 2025Updated 6 months ago
- Random Serum Patches☆17Apr 21, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A CRD for arbitrary properties about a cluster☆38Feb 12, 2026Updated 2 months ago
- Classifies percussion audio samples with a CNN-LSTM, written in python and pytorch. Also exports to Drumkv1 (lv2 plugin)☆14Aug 20, 2020Updated 5 years ago
- Apache YuniKorn Scheduler Interface☆34Updated this week
- Open Direct (OOH) Schema and examples☆16Jun 7, 2023Updated 2 years ago
- 📡 Deploy AI models and apps to Kubernetes without developing a hernia☆33May 23, 2024Updated last year
- ☆15Apr 3, 2025Updated last year
- Simplified model deployment on llm-d☆28Jul 2, 2025Updated 9 months ago
- C++ library for the implementation of tensor product calculations through a clean, concise user interface.☆26Aug 22, 2023Updated 2 years ago
- ☆31Jan 16, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Easy `inlets` client execution.☆12Jun 6, 2020Updated 5 years ago
- VST mini-host for debugging purposes☆15Jan 12, 2021Updated 5 years ago
- OS repo for Knowledge Retrieval starter kit☆66Jan 13, 2026Updated 2 months ago
- Proof of concept: Exploiting temporal coherence in LLM inference-- delta encoding for KV cache compression and weight-skip prediction. …☆43Apr 1, 2026Updated last week
- Fast-track AI innovation with a centralized, trusted, curated registry☆252Updated this week
- Vue app for https://github.com/bearpelican/musicautobot☆17Dec 10, 2022Updated 3 years ago
- KJob: Tool for CLI-loving ML researchers☆42Mar 31, 2026Updated last week