π An awesome & curated list of best LLMOps tools.
β204Feb 4, 2026Updated 3 weeks ago
Alternatives and similar repositories for Awesome-LLMOps
Users that are interested in Awesome-LLMOps are comparing it to the libraries listed below
Sorting:
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β289Jan 26, 2026Updated last month
- Following the same workflows as Kubernetes. Widely used in InftyAI community.β13Dec 5, 2025Updated 2 months ago
- π« A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIβ¦β25Dec 6, 2024Updated last year
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, iβ¦β12May 16, 2023Updated 2 years ago
- Simplified Data Management and Sharing for Kubernetesβ17Updated this week
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprisesβ26Apr 24, 2025Updated 10 months ago
- δΈε½εΌεθ ζ΄»ε¨ζ₯η¨οΌε ³ζ³¨ηΉοΌεΌζΊγεΌεθ γδΊεηοΌβ23Jan 30, 2026Updated last month
- Provides deploy scripts and CSI for Lustre.β14Oct 27, 2025Updated 4 months ago
- π§ Extensive LLM endpoints, expended capabilities through your favorite protocols, πΈοΈ GraphQL, βοΈ gRPC, βΎοΈ WebSocket. Extended SOTA suppβ¦β18Updated this week
- CPU DRA Driverβ32Feb 9, 2026Updated 2 weeks ago
- d.run websiteβ15Feb 9, 2026Updated 2 weeks ago
- WG Servingβ34Dec 15, 2025Updated 2 months ago
- GenAI inference performance benchmarking toolβ151Updated this week
- [Moved to https://github.com/kubernetes-sigs/kwok] This is a fake kubelet. that can simulate any number of nodes and maintain pods on thoβ¦β66Jul 20, 2022Updated 3 years ago
- β22Dec 28, 2024Updated last year
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Specβ62Updated this week
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replicationβ670Feb 18, 2026Updated last week
- Kubernetes LTS(long term support)β216Dec 24, 2024Updated last year
- β47Dec 8, 2025Updated 2 months ago
- Layer4 egress gateway for Kubernetesβ284Feb 21, 2026Updated last week
- Prototypes and experiments for WG Device Management.β15Feb 11, 2026Updated 2 weeks ago
- 𧬠The adaptive model routing system for exploration and exploitation.β22Jan 4, 2026Updated last month
- Open Model Engine (OME) β Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, Tβ¦β380Updated this week
- Fast is a Kubernetes CNI based on eBPF implementationβ36Feb 4, 2024Updated 2 years ago
- helm repo add daocloud https://daocloud.github.io/dce-charts-repackage/β12Feb 22, 2026Updated last week
- data plane testing utility of cloud nativeβ221Updated this week
- caniuse.com, but for kubernetesβ27Dec 25, 2024Updated last year
- MirageDebug: Local remote debugging for Kubernetes apps, enabling fully authentic environment debugging.β56May 27, 2024Updated last year
- Ferry is a Kubernetes multi-cluster communication component that eliminates communication differences between clusters as if they were inβ¦β104Jun 7, 2023Updated 2 years ago
- Gateway API Inference Extensionβ594Updated this week
- A workload for deploying LLM inference services on Kubernetesβ171Feb 18, 2026Updated last week
- Product ready cluster lifecycle management toolchains based on kubespray and other cluster LCM engine.β524Updated this week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β74Jul 18, 2025Updated 7 months ago
- Operator for the mutating admission webhook for ClusterResourceOverrideβ18Feb 13, 2026Updated 2 weeks ago
- An AI-powered CLI tool to enhance your Markdown workflow, with auto-image downloading, translation, and more features coming soon!β23Nov 4, 2025Updated 3 months ago
- β20Feb 19, 2026Updated last week
- [Moved to https://github.com/kubernetes-sigs/kwok] fake-k8s is a tool for running Fake Kubernetes clusters, It can be used as an alternatβ¦β19Jan 6, 2023Updated 3 years ago
- Federated middleware based on Karmadaβ48Nov 20, 2023Updated 2 years ago
- This project is designed to simulate GPU information, making it easier to test scenarios where a GPU is not available.β65Jan 9, 2026Updated last month