Kubernetes operator for self-hosted LLM inference across a heterogeneous GPU fleet: NVIDIA CUDA, AMD Vulkan, and Apple Silicon Metal. Runtimes: llama.cpp, vLLM, TGI, mlx-server. Multi-GPU sharding, model caching, OpenAI-compatible endpoints. Apache-2.0, run across homelab and on-prem fleets, actively developed.
☆148Jun 28, 2026Updated this week
Alternatives and similar repositories for LLMKube
Users that are interested in LLMKube are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Kubernetes cluster with Flux and Renovate☆30Jun 26, 2026Updated last week
- ☆10Jun 10, 2026Updated 3 weeks ago
- This playbook spawns a ready-to-use AWX system on K3S, on a Debian 11 or Ubuntu 22.04 host. AWX is a tool that can be used to manage mult…☆12Jan 22, 2023Updated 3 years ago
- ☆10Sep 30, 2018Updated 7 years ago
- Automation to configure and maintain PiKVM IP-KVMs☆13Jan 3, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- BPI-EAI80 AIoT board use Edgeless EAI80 chip design☆16Dec 26, 2022Updated 3 years ago
- Documentation source for microshift.io☆12Nov 21, 2025Updated 7 months ago
- CLI for creating github gists☆14Apr 20, 2017Updated 9 years ago
- This is a repo for our docker image of the kubelet binary.☆14Jun 15, 2026Updated 2 weeks ago
- A Python Flask app for demonstrating Kubernetes concepts☆10Apr 2, 2025Updated last year
- Prometheus exporter for Bosch Sensortec temperature, atmospheric pressure and humidity sensors☆12May 29, 2024Updated 2 years ago
- ☆10Mar 7, 2025Updated last year
- Forklift project top-level repository☆19May 7, 2022Updated 4 years ago
- Ansible role to manage Linux Logical Volume Manager resources☆15Jun 10, 2026Updated 3 weeks ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Forklift documentation☆23Jun 23, 2026Updated last week
- Ansible playbooks for multiple Openshift deploy/management tasks☆13May 28, 2021Updated 5 years ago
- Parses query args in Sanic using type annotations☆18Jul 8, 2025Updated 11 months ago
- Setup a k3s cluster on Raspberry Pi 4 SBCs.☆20Aug 6, 2020Updated 5 years ago
- Modify the initrd of Aerohive AP122, AP230 & AP245x to allow root access.☆17Mar 20, 2025Updated last year
- Ansible Role - Jellyfin☆11May 26, 2026Updated last month
- JSON Logging for Sanic☆10Sep 1, 2021Updated 4 years ago
- Notes on Diffy Qs, a textbook for differential equations☆11Jun 8, 2023Updated 3 years ago
- IaC, GitOps and all the fun stuff☆22Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An Ansible role to manage FreeNAS☆14Mar 1, 2019Updated 7 years ago
- Arduino and AWS Lambda Code for pushing images to AWS S3☆15Dec 19, 2021Updated 4 years ago
- 📡 Deploy AI models and apps to Kubernetes without developing a hernia☆33May 23, 2024Updated 2 years ago
- Selfhosting personal static websites using private web analytics and comments platforms☆14Sep 25, 2022Updated 3 years ago
- Renovate configuration presets☆19Jun 23, 2026Updated last week
- Easy `inlets` client execution.☆12Jun 6, 2020Updated 6 years ago
- ☆16Apr 21, 2020Updated 6 years ago
- The classic star trek game, mirrored for convenience☆13Apr 20, 2025Updated last year
- Learning Simulator: A simulation software for animal and human learning☆12Mar 8, 2026Updated 3 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Install and configure haproxy on your system.☆23Jun 25, 2026Updated last week
- An exporter of Prometheus for resque's status.☆12Jan 28, 2024Updated 2 years ago
- A CloudFront WAF as a Terraform module covering OWASP top 10☆10Jun 30, 2023Updated 3 years ago
- A starter repo for FastAPI + React, using Fern☆19Jun 22, 2026Updated last week
- Kubernetes Fluentd Logging☆14Jun 18, 2018Updated 8 years ago
- A Kafka aggregator based on the Faust Python Stream Processing library☆10Apr 10, 2023Updated 3 years ago
- Code of leader election sidecar container.☆20Apr 15, 2026Updated 2 months ago