substratusai / kubeai
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.
☆893Updated this week
Alternatives and similar repositories for kubeai:
Users that are interested in kubeai are comparing it to the libraries listed below
- Helm chart for Ollama on Kubernetes☆422Updated 2 weeks ago
- Cloud Native Agentic AI | Discord: https://bit.ly/kagentdiscord☆563Updated this week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆504Updated this week
- deployKF builds machine learning platforms on Kubernetes. We combine the best of Kubeflow, Airflow†, and MLflow† into a complete platform…☆425Updated 8 months ago
- ☆161Updated last week
- Community-maintained Kubernetes config and Helm chart for Langfuse☆97Updated this week
- Gateway API Inference Extension☆243Updated this week
- vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization☆1,078Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆402Updated last week
- 🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫☆184Updated last week
- Automatic SRE Superpowers within your Kubernetes cluster☆361Updated this week
- Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…☆1,424Updated last week
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI se…☆220Updated this week
- Kubernetes-native Job Queueing☆1,733Updated this week
- MCP Server for kubernetes management commands☆434Updated this week
- Finetune LLMs on K8s by using Runbooks☆170Updated 7 months ago
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆346Updated this week
- MCP server connecting to Kubernetes☆248Updated last week
- 🧬 Helix is a private GenAI stack for building AI applications with declarative pipelines, knowledge (RAG), API bindings, and first-class…☆490Updated this week
- Kubernetes AI Toolchain Operator☆572Updated this week
- 🏗️ Fine-tune, build, and deploy open-source LLMs easily!☆446Updated this week
- kro | Kube Resource Orchestrator☆1,747Updated this week
- A toolkit to run Ray applications on Kubernetes☆1,686Updated this week
- A Model Context Protocol (MCP) server for Kubernetes that enables AI assistants like Claude, Cursor, and others to interact with Kubernet…☆372Updated this week
- Add-on for KEDA to scale HTTP workloads☆416Updated this week
- Your 24/7 On-Call AI Agent - Solve Alerts Faster with Automatic Correlations, Investigations, and More☆841Updated this week
- Stateless cluster local OCI registry mirror.☆1,785Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆266Updated this week
- Kamaji is the Hosted Control Plane Manager for Kubernetes.☆1,375Updated this week
- OpenTelemetry Instrumentation for AI Observability☆380Updated this week