volcano-sh/kthena

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/volcano-sh/kthena)

volcano-sh / kthena

Kubernetes-native AI serving platform for scalable model serving.

☆397

Alternatives and similar repositories for kthena

Users that are interested in kthena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

volcano-sh / agentcube
View on GitHub
☆159Updated this week
volcano-sh / volcano-global
View on GitHub
A federation scheduler for multi-cluster
☆76Mar 6, 2026Updated 4 months ago
sgl-project / rbg
View on GitHub
A workload for deploying LLM inference services on Kubernetes
☆267Updated this week
kubernetes-sigs / lws
View on GitHub
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
☆773Updated this week
ome-projects / ome
View on GitHub
Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, T…
☆482Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Project-HAMi / volcano-vgpu-device-plugin
View on GitHub
Device-plugin for volcano vgpu which support hard resource isolation
☆162Jun 9, 2026Updated last month
kmesh-net / kmesh
View on GitHub
High Performance ServiceMesh Data Plane Based on eBPF and Programmable Kernel
☆739Jun 29, 2026Updated last month
kai-scheduler / KAI-Scheduler
View on GitHub
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
☆1,423Updated this week
kubernetes-sigs / gateway-api-inference-extension
View on GitHub
Gateway API Inference Extension
☆725Updated this week
Project-HAMi / HAMi
View on GitHub
Heterogeneous GPU Sharing on Kubernetes
☆4,096Updated this week
kurator-dev / kurator
View on GitHub
Unified resource orchestration, unified scheduling, unified traffic management and unified telemetry for distributed cloud
☆256Sep 22, 2025Updated 10 months ago
volcano-sh / volcano
View on GitHub
A Cloud Native Batch System (Project under CNCF)
☆5,813Updated this week
llm-d / llm-d
View on GitHub
Achieve state of the art inference performance with modern accelerators on Kubernetes
☆3,898Updated this week
openkruise / agents
View on GitHub
Rapid and cost-effective operator and best practice for agent sandbox lifecycle management.
☆249Updated this week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kubernetes-sigs / kueue
View on GitHub
Kubernetes-native Job Queueing
☆2,756Updated this week
BaizeAI / kcover
View on GitHub
🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.
☆35Updated this week
kmesh-net / orion
View on GitHub
Orion - Cloud-native high-performance proxy implemented in Rust attempting to be a drop in replacement for Envoy Proxy
☆38Jun 29, 2026Updated last month
kubernetes-sigs / dra-driver-nvidia-gpu
View on GitHub
DRA Driver for NVIDIA GPUs
☆678Updated this week
kubewharf / godel-scheduler
View on GitHub
a unified scheduler for online and offline tasks
☆675Mar 2, 2026Updated 4 months ago
kubewharf / katalyst-core
View on GitHub
Katalyst aims to provide a universal solution to help improve resource utilization and optimize the overall costs in the cloud. This is t…
☆560Updated this week
llm-d / llm-d-router
View on GitHub
llm-d Router: The intelligent entry point for inference requests
☆272Updated this week
karmada-io / karmada
View on GitHub
Open, Multi-Cloud, Multi-Cluster Kubernetes Orchestration
☆5,545Updated this week
copilot-io / runtime-copilot
View on GitHub
The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…
☆13May 16, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
knoway-dev / knoway
View on GitHub
An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises
☆27Apr 24, 2025Updated last year
kappital / kappital
View on GitHub
A Cloud-Native Service Catalog and Full Lifecycle Management Platform accross Multi-cloud and Edge
☆32Sep 28, 2023Updated 2 years ago
volcano-sh / apis
View on GitHub
The API (CRD) of Volcano
☆50Jul 10, 2026Updated 2 weeks ago
InftyAI / llmaz
View on GitHub
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
☆309Jan 26, 2026Updated 6 months ago
koordinator-sh / koordinator
View on GitHub
A QoS-based scheduling system brings optimal layout and status to workloads such as microservices, web services, big data jobs, AI jobs, …
☆1,729Updated this week
ai-dynamo / grove
View on GitHub
Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
☆245Updated this week
kubernetes-sigs / agent-sandbox
View on GitHub
agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.
☆3,319Updated this week
NVIDIA / knavigator
View on GitHub
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
☆79Jul 6, 2026Updated 3 weeks ago
Project-HAMi / HAMi-core
View on GitHub
HAMi-core compiles libvgpu.so, which ensures hard limit on GPU in container
☆319Updated this week
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
llm-d / llm-d-kv-cache
View on GitHub
Distributed KV cache scheduling & offloading libraries
☆165Updated this week
kuasar-io / kuasar
View on GitHub
A multi-sandbox container runtime that provides cloud-native, all-scenario multiple sandbox container solutions.
☆1,438Jun 5, 2026Updated last month
llm-d / llm-d-workload-variant-autoscaler
View on GitHub
Variant optimization autoscaler for distributed inference workloads
☆52Updated this week
karmada-io / multicluster-cloud-provider
View on GitHub
Defines the shared interfaces which Karmada cloud providers implement. These interfaces allow various controllers to integrate with any c…
☆16Mar 19, 2026Updated 4 months ago
envoyproxy / ai-gateway
View on GitHub
Manages Unified Access to Generative AI Services built on Envoy Gateway
☆1,873Updated this week
vllm-project / aibrix
View on GitHub
Cost-efficient and pluggable Infrastructure components for GenAI inference
☆4,982Updated this week
ai-dynamo / dynamo
View on GitHub
A Datacenter Scale Distributed Inference Serving Framework
☆7,608Updated this week