substratusai / kubeai
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.
☆706Updated this week
Alternatives and similar repositories for kubeai:
Users that are interested in kubeai are comparing it to the libraries listed below
- Helm chart for Ollama on Kubernetes☆362Updated this week
- Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…☆1,231Updated this week
- Finetune LLMs on K8s by using Runbooks☆170Updated 5 months ago
- ☆117Updated this week
- Community-maintained Kubernetes config and Helm chart for Langfuse☆74Updated 3 weeks ago
- Automatic SRE Superpowers within your Kubernetes cluster☆340Updated this week
- An open source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI artifact.☆693Updated this week
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆312Updated this week
- 🧬 Helix is a private GenAI stack for building AI applications with declarative pipelines, knowledge (RAG), API bindings, and first-class…☆414Updated this week
- OpenTelemetry Instrumentation for AI Observability☆293Updated this week
- deployKF builds machine learning platforms on Kubernetes. We combine the best of Kubeflow, Airflow†, and MLflow† into a complete platform…☆396Updated 6 months ago
- Gateway API Inference Extension☆146Updated this week
- Your friendly and safe CLI Copilot☆255Updated 5 months ago
- Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫☆138Updated 2 weeks ago
- On-Call Assistant for Prometheus Alerts - Get a head start on fixing alerts with AI investigation☆636Updated this week
- Kubernetes AI Toolchain Operator☆529Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆231Updated this week
- Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…☆720Updated this week
- Open Weight, tool-calling LLMs☆151Updated 3 months ago
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI se…☆125Updated this week
- This is a landscape of the infrastructure that powers the generative AI ecosystem☆134Updated 4 months ago
- ☆112Updated this week
- Kubernetes-native Job Queueing☆1,616Updated this week
- Chart for deploying ChromaDB in Kubernetes☆42Updated 3 weeks ago
- A holistic framework to enable the design, development, and evaluation of autonomous AIOps agents.☆474Updated this week
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆704Updated 3 weeks ago
- ☆157Updated last week
- A collection of YAML files, Helm Charts, Operator code, and guides to act as an example reference implementation for NVIDIA NIM deploymen…☆156Updated this week
- This is a fork/refactoring of the ajmyyra/ambassador-auth-oidc project☆88Updated 10 months ago