InftyAI / Awesome-LLMOpsLinks
π An awesome & curated list of best LLMOps tools.
β189Updated last week
Alternatives and similar repositories for Awesome-LLMOps
Users that are interested in Awesome-LLMOps are comparing it to the libraries listed below
Sorting:
- βΈοΈ Easy, advanced inference platform for large language models on Kubernetes. π Star to support our work!β287Updated last week
- agent-sandbox enables easy management of isolated, stateful, singleton workloads, ideal for use cases like AI agent runtimes.β825Updated last week
- Kubernetes-native AI serving platform for scalable model serving.β198Updated this week
- A diverse, simple, and secure all-in-one LLMOps platformβ109Updated last year
- LeaderWorkerSet: An API for deploying a group of pods as a unit of replicationβ656Updated last week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.β146Updated this week
- WG Servingβ34Updated last month
- llm-d helm charts and deployment examplesβ48Updated last month
- π« A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSIβ¦β25Updated last year
- Gateway API Inference Extensionβ573Updated last week
- GenAI inference performance benchmarking toolβ141Updated last week
- A workload for deploying LLM inference services on Kubernetesβ167Updated this week
- A federation scheduler for multi-clusterβ61Updated last week
- Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.β93Updated 3 months ago
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscalingβ159Updated last week
- π’ Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! π«β229Updated this week
- K8s-mcp-server is a Model Context Protocol (MCP) server that enables AI assistants like Claude to securely execute Kubernetes commands. Iβ¦β181Updated 9 months ago
- MCP Server for kubernetes management and diagnose your cluster and applicationsβ27Updated 8 months ago
- A Site Reliability Engineer AI agent that can monitor application and infrastructure logs, diagnose issues, and report on diagnostics.β141Updated this week
- Inference scheduler for llm-dβ124Updated this week
- JobSet: a k8s native API for distributed ML training and HPC workloadsβ304Updated this week
- Large language model fine-tuning capabilities based on cloud native and distributed computing.β92Updated last year
- Distributed KV cache scheduling & offloading librariesβ101Updated this week
- Fast-track AI innovation with a centralized, trusted, curated registryβ166Updated this week
- Helm charts for llm-dβ52Updated 6 months ago
- MCP server connecting to Kubernetesβ369Updated last month
- Command-line tools for managing OCI model artifacts, which are bundled based on Model Specβ60Updated last week
- knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.β74Updated 6 months ago
- Device-plugin for volcano vgpu which support hard resource isolationβ143Updated last month
- Cloud Native Artifacial Intelligence Model Format Specificationβ175Updated last week