substratusai / kubeaiLinks

AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.

☆1,029

Alternatives and similar repositories for kubeai

Users that are interested in kubeai are comparing it to the libraries listed below

Sorting:

NVIDIA / KAI-Scheduler
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
☆723Updated this week
otwld / ollama-helm
Helm chart for Ollama on Kubernetes
☆472Updated this week
llm-d / llm-d
llm-d is a Kubernetes-native high-performance distributed LLM inference framework
☆1,443Updated this week
kubernetes-sigs / gateway-api-inference-extension
Gateway API Inference Extension
☆415Updated this week
kagent-dev / kagent
Cloud Native Agentic AI | Discord: https://bit.ly/kagentdiscord
☆1,267Updated this week
kubernetes-sigs / lws
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
☆526Updated this week
langfuse / langfuse-k8s
Community-maintained Kubernetes config and Helm chart for Langfuse
☆138Updated 2 weeks ago
open-webui / helm-charts
☆208Updated this week
openlit / openlit
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…
☆1,757Updated this week
deployKF / deployKF
deployKF builds machine learning platforms on Kubernetes. We combine the best of Kubeflow, Airflow†, and MLflow† into a complete platform…
☆441Updated last year
NVIDIA / k8s-dra-driver-gpu
NVIDIA DRA Driver for GPUs
☆402Updated this week
containers / kubernetes-mcp-server
Model Context Protocol (MCP) server for Kubernetes and OpenShift
☆416Updated last week
k8sgpt-ai / k8sgpt-operator
Automatic SRE Superpowers within your Kubernetes cluster
☆385Updated last week
kubernetes-sigs / kueue
Kubernetes-native Job Queueing
☆1,908Updated this week
kaito-project / kaito
Kubernetes AI Toolchain Operator
☆691Updated this week
Flux159 / mcp-server-kubernetes
MCP Server for kubernetes management commands
☆935Updated this week
InftyAI / Awesome-LLMOps
🎉 An awesome & curated list of best LLMOps tools.
☆141Updated this week
kubeflow / model-registry
Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…
☆138Updated this week
nekomeowww / ollama-operator
🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫
☆198Updated this week
robusta-dev / holmesgpt
Your 24/7 On-Call AI Agent - Solve Alerts Faster with Automatic Correlations, Investigations, and More
☆1,137Updated this week
InftyAI / llmaz
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
☆228Updated last week
kubernetes-sigs / jobset
JobSet: a k8s native API for distributed ML training and HPC workloads
☆246Updated this week
agentgateway / agentgateway
Next Generation Agentic Proxy for AI Agents and MCP servers
☆297Updated this week
kitops-ml / kitops
An open source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI artifact.
☆1,092Updated this week
NVIDIA / nim-deploy
A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…
☆186Updated last week
strowk / mcp-k8s-go
MCP server connecting to Kubernetes
☆331Updated this week
kserve / modelmesh-serving
Controller for ModelMesh
☆239Updated last month
Arize-ai / openinference
OpenTelemetry Instrumentation for AI Observability
☆521Updated this week
alexei-led / k8s-mcp-server
K8s-mcp-server is a Model Context Protocol (MCP) server that enables AI assistants like Claude to securely execute Kubernetes commands. I…
☆159Updated 3 months ago
nebuly-ai / nos
Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elas…
☆667Updated last year