nekomeowww / ollama-operator
🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫
☆177Updated this week
Alternatives and similar repositories for ollama-operator:
Users that are interested in ollama-operator are comparing it to the libraries listed below
- 🎉 An awesome & curated list of best LLMOps tools.☆78Updated this week
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆109Updated this week
- MCP server connecting to Kubernetes☆140Updated last week
- Your AI Kubernetes Expert☆178Updated last year
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆16Updated 3 months ago
- Dragonfly Helm Charts☆35Updated last week
- caniuse.com, but for kubernetes☆24Updated 3 months ago
- Model Context Protocol (MCP) server for Kubernetes and OpenShift☆48Updated this week
- Envoy AI Gateway is an open source project for using Envoy Gateway to handle request traffic from application clients to Generative AI se…☆197Updated this week
- 💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…☆20Updated 3 months ago
- 🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.☆29Updated 3 months ago
- [Moved to https://github.com/kubernetes-sigs/kwok] fake-k8s is a tool for running Fake Kubernetes clusters, It can be used as an alternat…☆18Updated 2 years ago
- MCP Server for kubernetes management commands☆218Updated this week
- Kubernetes APIServer 高性能代理组件,代理 APIServer 的 List 请求,其它类型的请求会直接反向代理到原生 APIServer。 CKube 还额外支持了分页、搜索和索引等功能。 并且,CKube 100% 兼容原生 kubectl 和 ku…☆19Updated 2 years ago
- Kubernetes API Translator☆74Updated this week
- ☆98Updated last week
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12Updated last year
- The official Kubernetes operator for etcd.☆53Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.☆65Updated last week
- Helm chart for Ollama on Kubernetes☆398Updated last week
- ☆12Updated 3 weeks ago
- This project is designed to simulate GPU information, making it easier to test scenarios where a GPU is not available.☆30Updated 3 weeks ago
- Gateway API Inference Extension☆186Updated this week
- Spin Operator is a Kubernetes operator that empowers platform engineers to deploy Spin applications as custom resources to their Kubernet…☆244Updated last week
- Smart Kubernetes Scheduling☆77Updated last week
- Manage kubernetes node-level kernel tuning ( using sysctl ).☆27Updated 3 weeks ago
- JobSet: a k8s native API for distributed ML training and HPC workloads☆211Updated this week
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.☆68Updated 2 weeks ago
- Lightweight KubeVela that runs as Daemon in single node with high availability.☆70Updated last month
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replication☆354Updated this week