substratusai / kubeai
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.
☆843Updated this week
Alternatives and similar repositories for kubeai:
Users that are interested in kubeai are comparing it to the libraries listed below
- Helm chart for Ollama on Kubernetes☆392Updated this week
- Cloud Native Agentic AI☆310Updated this week
- ☆145Updated this week
- Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…☆1,347Updated last week
- Kubernetes AI Toolchain Operator☆555Updated this week
- deployKF builds machine learning platforms on Kubernetes. We combine the best of Kubeflow, Airflow†, and MLflow† into a complete platform…☆411Updated 7 months ago
- 🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫☆175Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆339Updated this week
- Your 24/7 On-Call AI Agent - Solve Alerts Faster with Automatic Correlations, Investigations, and More☆744Updated this week
- Community-maintained Kubernetes config and Helm chart for Langfuse☆89Updated this week
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆330Updated this week
- Gateway API Inference Extension☆183Updated this week
- vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization☆877Updated this week
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI se…☆181Updated this week
- Automatic SRE Superpowers within your Kubernetes cluster☆348Updated this week
- Finetune LLMs on K8s by using Runbooks☆170Updated 6 months ago
- An AI-Powered assistant for Kubernetes developers☆177Updated last year
- Kubernetes-native Job Queueing☆1,671Updated this week
- MCP server for Grafana☆176Updated this week
- Kamaji is the Hosted Control Plane Manager for Kubernetes.☆1,315Updated this week
- OpenTelemetry Instrumentation for AI Observability☆339Updated this week
- User documentation for KServe.☆104Updated last week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆194Updated this week
- kro | Kube Resource Orchestrator☆1,499Updated this week
- LLMPerf is a library for validating and benchmarking LLMs☆826Updated 3 months ago
- Controller for ModelMesh☆225Updated 3 weeks ago
- Karpenter is a Kubernetes Node Autoscaler built for flexibility, performance, and simplicity.☆815Updated this week
- MCP Server for kubernetes management commands☆171Updated this week
- Convert Ingress resources to Gateway API resources☆434Updated last month
- Your friendly and safe CLI Copilot☆259Updated 6 months ago