substratusai / kubeaiLinks
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.
☆970Updated last week
Alternatives and similar repositories for kubeai
Users that are interested in kubeai are comparing it to the libraries listed below
Sorting:
- Helm chart for Ollama on Kubernetes☆444Updated last week
- Cloud Native Agentic AI | Discord: https://bit.ly/kagentdiscord☆836Updated this week
- Gateway API Inference Extension☆304Updated last week
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI se…☆275Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆457Updated this week
- Automatic SRE Superpowers within your Kubernetes cluster☆369Updated this week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆590Updated this week
- llm-d is a Kubernetes-native high-performance distributed LLM inference framework☆987Updated this week
- Community-maintained Kubernetes config and Helm chart for Langfuse☆109Updated this week
- Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…☆1,585Updated this week
- deployKF builds machine learning platforms on Kubernetes. We combine the best of Kubeflow, Airflow†, and MLflow† into a complete platform…☆434Updated 10 months ago
- Kubernetes-native Job Queueing☆1,798Updated this week
- OpenTelemetry Instrumentation for AI Observability☆442Updated this week
- ☆179Updated this week
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆363Updated this week
- kro | Kube Resource Orchestrator☆2,067Updated last week
- 🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫☆189Updated this week
- An AI-Powered assistant for Kubernetes developers☆185Updated last year
- Model Context Protocol (MCP) server for Kubernetes and OpenShift☆232Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆229Updated this week
- Your 24/7 On-Call AI Agent - Solve Alerts Faster with Automatic Correlations, Investigations, and More☆914Updated this week
- vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization☆1,290Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆317Updated this week
- Watch k8s events and trigger Handlers☆712Updated last week
- MCP Server for kubernetes management commands☆695Updated last week
- Kamaji is the Hosted Control Plane Manager for Kubernetes.☆1,500Updated this week
- ✨ Kubectl plugin to create manifests with LLMs☆1,174Updated 4 months ago
- Finetune LLMs on K8s by using Runbooks☆170Updated 9 months ago
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.☆79Updated this week
- Kubernetes AI Toolchain Operator☆621Updated this week