substratusai / kubeai
AI Inference Operator for Kubernetes. The easiest way to serve ML models in production. Supports VLMs, LLMs, embeddings, and speech-to-text.
☆826Updated this week
Alternatives and similar repositories for kubeai:
Users that are interested in kubeai are comparing it to the libraries listed below
- Helm chart for Ollama on Kubernetes☆382Updated this week
- Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Manageme…☆1,317Updated this week
- ☆136Updated this week
- Community-maintained Kubernetes config and Helm chart for Langfuse☆85Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆324Updated this week
- Dynamic Resource Allocation (DRA) for NVIDIA GPUs in Kubernetes☆327Updated this week
- Automatic SRE Superpowers within your Kubernetes cluster☆346Updated this week
- Finetune LLMs on K8s by using Runbooks☆170Updated 6 months ago
- Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, ev…☆833Updated last week
- OpenTelemetry Instrumentation for AI Observability☆327Updated last week
- deployKF builds machine learning platforms on Kubernetes. We combine the best of Kubeflow, Airflow†, and MLflow† into a complete platform…☆410Updated 7 months ago
- Kubernetes AI Toolchain Operator☆543Updated this week
- A holistic framework to enable the design, development, and evaluation of autonomous AIOps agents.☆516Updated this week
- Gateway API Inference Extension☆176Updated this week
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI se…☆164Updated this week
- An open source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI artifact.☆727Updated this week
- 🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫☆166Updated this week
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆717Updated this week
- LLMPerf is a library for validating and benchmarking LLMs☆809Updated 3 months ago
- 🧬 Helix is a private GenAI stack for building AI applications with declarative pipelines, knowledge (RAG), API bindings, and first-class…☆453Updated this week
- Generic rag framework to apply the power of LLMs on any given dataset☆535Updated last week
- A toolkit to create optimal Production-readyRetrieval Augmented Generation(RAG) setup for your data☆1,394Updated last month
- JobSet: a k8s native API for distributed ML training and HPC workloads☆194Updated this week
- A toolkit to run Ray applications on Kubernetes☆1,554Updated this week
- Kubernetes-native Job Queueing☆1,653Updated this week
- Evaluate and Enhance Your LLM Deployments for Real-World Inference Needs☆207Updated this week
- Fast, Accurate, Lightweight Python library to make State of the Art Embedding☆1,844Updated this week
- Your 24/7 On-Call AI Agent - Solve Alerts Faster with Automatic Correlations, Investigations, and More☆712Updated this week
- kro | Kube Resource Orchestrator☆1,392Updated this week