InftyAI / Awesome-LLMOps
π An awesome & curated list of best LLMOps tools.
β87Updated last week
Alternatives and similar repositories for Awesome-LLMOps:
Users that are interested in Awesome-LLMOps are comparing it to the libraries listed below
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β127Updated this week
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β67Updated last week
- π« A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIβ¦β21Updated 4 months ago
- π§― Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.β29Updated 4 months ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, iβ¦β12Updated last year
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β65Updated last week
- Knowledge for GPTScriptβ29Updated 5 months ago
- Manage kubernetes node-level kernel tuning ( using sysctl ).β28Updated last month
- WG Servingβ24Updated last week
- Smart Kubernetes Schedulingβ78Updated this week
- π’ Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! π«β185Updated this week
- Example DRA driver that developers can fork and modify to get them started writing their own.β69Updated last month
- β¨Kubewizard is An AI-Agent for automated Kubernetes troubleshooting, and management, based on LangChain and k8s related tools.β18Updated 3 months ago
- GenAI inference performance benchmarking toolβ39Updated this week
- K8s device plugin for GPU sharingβ100Updated last year
- A diverse, simple, and secure all-in-one LLMOps platformβ102Updated 7 months ago
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.β93Updated this week
- Model Context Protocol (MCP) server for Kubernetes and OpenShiftβ122Updated this week
- π‘ Deploy AI models and apps to Kubernetes without developing a herniaβ32Updated 11 months ago
- OpenCIDN (Open Container Image Deliver Network)β13Updated last week
- β16Updated last month
- β12Updated 2 weeks ago
- CAPK is a provider for Cluster API (CAPI) that allows users to deploy fake, Kubemark-backed machines to their clusters.β72Updated 3 weeks ago
- Following the same workflows as Kubernetes. Widely used in InftyAI community.β12Updated this week
- β35Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replicationβ411Updated this week
- A toolkit for discovering cluster network topology.β46Updated this week
- Gateway API Inference Extensionβ256Updated this week
- Kuberentes LangChain Agent - Interact with Kubernetes Clusters using LLMsβ26Updated 2 years ago
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Specβ20Updated this week