Kubernetes-native AI serving platform for scalable model serving.
☆267Mar 19, 2026Updated last week
Alternatives and similar repositories for kthena
Users that are interested in kthena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆114Mar 5, 2026Updated 3 weeks ago
- Unified resource orchestration, unified scheduling, unified traffic management and unified telemetry for distributed cloud☆258Sep 22, 2025Updated 6 months ago
- High Performance ServiceMesh Data Plane Based on eBPF and Programmable Kernel☆707Mar 19, 2026Updated last week
- Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…☆404Updated this week
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13Dec 5, 2025Updated 3 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Variant optimization autoscaler for distributed inference workloads☆34Mar 19, 2026Updated last week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆1,191Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆682Updated this week
- NVIDIA DRA Driver for GPUs☆593Updated this week
- A workload for deploying LLM inference services on Kubernetes☆192Updated this week
- Katalyst aims to provide a universal solution to help improve resource utilization and optimize the overall costs in the cloud. This is t…☆543Mar 20, 2026Updated last week
- Heterogeneous GPU Sharing on Kubernetes☆3,110Mar 19, 2026Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.☆76Jul 18, 2025Updated 8 months ago
- Vector Database & AI Gateway written in Go. Supports HNSW, Hybrid Search (BM25), GraphRAG context, a built-in RAG Pipeline, and can be em…☆65Mar 18, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- fuse-overlayfs plugin for rootless containerd on old Linux (not needed on modern Linux)☆50Jan 28, 2026Updated last month
- WG Serving☆34Mar 5, 2026Updated 3 weeks ago
- interact with your Kubernetes pods, retrieve information about pods, and receive expert insights and recommendations from gpt☆18Mar 26, 2024Updated 2 years ago
- Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration☆5,336Mar 20, 2026Updated last week
- SDK for Envoy Lua extensions☆45Jan 4, 2026Updated 2 months ago
- A Cloud Native Batch System (Project under CNCF)☆5,395Mar 19, 2026Updated last week
- A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, …☆1,665Mar 16, 2026Updated last week
- Gateway API Inference Extension☆616Updated this week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆26Dec 6, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- NVSentinel is a cross-platform fault remediation service designed to rapidly remediate runtime node-level issues in GPU-accelerated compu…☆227Mar 19, 2026Updated last week
- agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.☆1,480Updated this week
- Go Abstraction for Allocating NVIDIA GPUs with Custom Policies☆122Dec 8, 2025Updated 3 months ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- A High Performance Metadata System for Kubernetes☆885May 13, 2024Updated last year
- MCP Server for kubernetes management and diagnose your cluster and applications☆28May 16, 2025Updated 10 months ago
- Achieve state of the art inference performance with modern accelerators on Kubernetes☆2,657Updated this week
- Orchestration and memory for multi-agent systems☆14Feb 6, 2026Updated last month
- KServe V2 Protocol Rest API Implementation Proxy☆13Mar 19, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- MirageDebug: Local remote debugging for Kubernetes apps, enabling fully authentic environment debugging.☆56May 27, 2024Updated last year
- tony k8s device-plugin,一个简单的 k8s device-plugin 实现以及原理分析教程。☆29Mar 29, 2025Updated 11 months ago
- Kubernetes-native Job Queueing☆2,399Updated this week
- A multi-sandbox container runtime that provides cloud-native, all-scenario multiple sandbox container solutions.☆1,404Updated this week
- Pseudo-tty handler for docker Go client https://github.com/fsouza/go-dockerclient☆29Jun 12, 2018Updated 7 years ago
- Health checks for Azure N- and H-series VMs.☆57Feb 5, 2026Updated last month
- A federation scheduler for multi-cluster☆64Mar 6, 2026Updated 3 weeks ago