InftyAI / Awesome-LLMOpsLinks

🎉 An awesome & curated list of best LLMOps tools.

☆141

Alternatives and similar repositories for Awesome-LLMOps

Users that are interested in Awesome-LLMOps are comparing it to the libraries listed below

Sorting:

InftyAI / llmaz
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
☆228Updated this week
NVIDIA / k8s-nim-operator
An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.
☆119Updated this week
llmariner / llmariner
Extensible generative AI platform on Kubernetes with OpenAI-compatible APIs.
☆87Updated last week
kubeagi / arcadia
A diverse, simple, and secure all-in-one LLMOps platform
☆107Updated 10 months ago
kubernetes-sigs / inference-perf
GenAI inference performance benchmarking tool
☆71Updated this week
kubernetes-sigs / lws
LeaderWorkerSet: An API for deploying a group of pods as a unit of replication
☆526Updated this week
copilot-io / runtime-copilot
The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…
☆12Updated 2 years ago
NVIDIA / knavigator
knavigator is a development, testing, and optimization toolkit for AI/ML scheduling systems at scale on Kubernetes.
☆69Updated 2 weeks ago
InftyAI / Manta
💫 A lightweight p2p-based cache system for model distributions on Kubernetes. Reframing now to make it an unified cache system with POSI…
☆24Updated 7 months ago
nekomeowww / ollama-operator
🚢 Yet another operator for running large language models on Kubernetes with ease. Powered by Ollama! 🐫
☆199Updated this week
qingwave / kubewizard
✨Kubewizard is An AI-Agent for automated Kubernetes troubleshooting, and management, based on LangChain and k8s related tools.
☆24Updated 6 months ago
wenhuwang / mcp-k8s-eye
MCP Server for kubernetes management and diagnose your cluster and applications
☆22Updated 2 months ago
opea-project / GenAIInfra
Containerization and cloud native suite for OPEA
☆69Updated 2 weeks ago
kubernetes-sigs / gateway-api-inference-extension
Gateway API Inference Extension
☆415Updated this week
BaizeAI / kcover
🧯 Kubernetes coverage for fault awareness and recovery, works for any LLMOps, MLOps, AI workloads.
☆31Updated last week
DataTunerX / datatunerx
Large language model fine-tuning capabilities based on cloud native and distributed computing.
☆91Updated last year
tensorchord / ai-infra-landscape
This is a landscape of the infrastructure that powers the generative AI ecosystem
☆148Updated 9 months ago
NVIDIA / nvkind
☆162Updated last week
NVIDIA / KAI-Scheduler
KAI Scheduler is an open source Kubernetes Native scheduler for AI workloads at large scale
☆723Updated this week
kubernetes-sigs / jobset
JobSet: a k8s native API for distributed ML training and HPC workloads
☆246Updated this week
d-run / drun-docs
d.run website
☆16Updated this week
sgl-project / ome
OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)
☆202Updated this week
kubernetes-sigs / wg-serving
WG Serving
☆28Updated last week
feiskyer / kube-copilot
Kubernetes Copilot powered by AI (OpenAI/Claude/Gemini/etc)
☆179Updated this week
knight42 / kopilot
Your AI Kubernetes Expert
☆183Updated 2 years ago
NVIDIA / topograph
A toolkit for discovering cluster network topology.
☆59Updated last week
alexei-led / k8s-mcp-server
K8s-mcp-server is a Model Context Protocol (MCP) server that enables AI assistants like Claude to securely execute Kubernetes commands. I…
☆159Updated 3 months ago
kubernetes-sigs / dra-example-driver
Example DRA driver that developers can fork and modify to get them started writing their own.
☆85Updated this week
llm-d / llm-d-kv-cache-manager
Distributed KV cache coordinator
☆43Updated this week
kerthcet / github-workflow-as-kube
Following the same workflows as Kubernetes. Widely used in InftyAI community.
☆13Updated 3 weeks ago