kubeai-project / kubeaiLinks
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.
☆1,081Updated last week
Alternatives and similar repositories for kubeai
Users that are interested in kubeai are comparing it to the libraries listed below
Sorting:
- Helm chart for Ollama on Kubernetes☆508Updated last week
- KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale☆857Updated last week
- Gateway API Inference Extension☆501Updated this week
- Community-maintained Kubernetes config and Helm chart for Langfuse☆171Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆601Updated this week
- Achieve state of the art inference performance with modern accelerators on Kubernetes☆1,907Updated this week
- Kubernetes-native Job Queueing☆2,027Updated last week
- Cloud Native Agentic AI | Discord: https://bit.ly/kagentdiscord☆1,665Updated this week
- deployKF builds machine learning platforms on Kubernetes. We combine the best of Kubeflow, Airflow†, and MLflow† into a complete platform…☆455Updated last year
- NVIDIA DRA Driver for GPUs☆458Updated last week
- ☆227Updated last week
- Kubernetes AI Toolchain Operator☆771Updated this week
- Model Context Protocol (MCP) server for Kubernetes and OpenShift☆714Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloads☆268Updated last week
- Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…☆1,977Updated this week
- A toolkit to run Ray applications on Kubernetes☆2,092Updated this week
- Automatic SRE Superpowers within your Kubernetes cluster☆395Updated this week
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆260Updated last week
- MCP Server for kubernetes management commands☆1,135Updated last month
- Your 24/7 On-Call AI Agent - Solve Alerts Faster with Automatic Correlations, Investigations, and More☆1,423Updated this week
- 🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫☆219Updated this week
- An open source DevOps tool from the CNCF for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI Artifac…☆1,210Updated last week
- Model Registry provides a single pane of glass for ML model developers to index and manage models, versions, and ML artifacts metadata. I…☆150Updated last week
- Next Generation Agentic Proxy for AI Agents and MCP servers☆1,130Updated this week
- CLI tool and Kubernetes Controller for building, testing and deploying MCP servers☆350Updated this week
- OpenTelemetry Instrumentation for AI Observability☆675Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆655Updated this week
- Controller for ModelMesh☆237Updated 4 months ago
- Module to Automatically maximize the utilization of GPU resources in a Kubernetes cluster through real-time dynamic partitioning and elas…☆672Updated last year
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆130Updated this week