llm-d/llm-d-model-service

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/llm-d/llm-d-model-service)

llm-d / llm-d-model-service

Simplified model deployment on llm-d

☆29

Alternatives and similar repositories for llm-d-model-service

Users that are interested in llm-d-model-service are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

llm-d / llm-d-benchmark
View on GitHub
llm-d benchmark scripts and tooling
☆62Updated this week
llm-d-incubation / llm-d-modelservice
View on GitHub
helm charts for deploying models with llm-d
☆31Updated this week
llm-d / llm-d-routing-sidecar
View on GitHub
Incubating P/D sidecar for llm-d
☆17Nov 13, 2025Updated 8 months ago
llm-d / llm-d-deployer
View on GitHub
Helm charts for llm-d
☆52Jul 22, 2025Updated last year
llm-d / llm-d-workload-variant-autoscaler
View on GitHub
Variant optimization autoscaler for distributed inference workloads
☆52Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
modelpack / model-csi-driver
View on GitHub
Kubernetes CSI Driver for serving OCI model artifacts
☆28May 25, 2026Updated 2 months ago
clubanderson / labeler
View on GitHub
label ALL kubectl, kustomize, and helm objects, inline, without extra steps.(including namespaces and CRDs)
☆15Apr 22, 2024Updated 2 years ago
llm-d / llm-d-router
View on GitHub
llm-d Router: The intelligent entry point for inference requests
☆272Updated this week
llm-d / llm-d-kv-cache
View on GitHub
Distributed KV cache scheduling & offloading libraries
☆165Updated this week
aztecher / bdc
View on GitHub
BDC is the eBPF powered DNS caching mechanism in kernel inspired by BMC
☆10May 13, 2022Updated 4 years ago
llm-d / llm-d-inference-sim
View on GitHub
A lightweight, configurable, and real-time simulator designed to mimic the behavior of vLLM without the need for GPUs or running actual h…
☆172Updated this week
IBM / controller-zero-scaler
View on GitHub
Automatically scales Kubernetes controllers to zero
☆16May 30, 2019Updated 7 years ago
IBM / kone
View on GitHub
Build and deploy Node.js application on Kubernetes
☆16Sep 17, 2025Updated 10 months ago
iter8-tools / iter8
View on GitHub
Kubernetes release optimizer
☆268Aug 1, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
weimeilin79 / camel-k-example-event-streaming
View on GitHub
☆10Apr 30, 2020Updated 6 years ago
np-guard / netpol-analyzer
View on GitHub
A Golang library for analyzing k8s connectivity-configuration resources (a.k.a. network policies)
☆19Feb 1, 2026Updated 5 months ago
berkmancenter / adf
View on GitHub
Augmented Dickey-Fuller implementation in Go
☆12Mar 15, 2019Updated 7 years ago
gardener-attic / vpa-exporter
View on GitHub
[DEPRECATED] Prometheus exporter for VPA recommendations
☆12Aug 22, 2023Updated 2 years ago
aicoe-aiops / project-template
View on GitHub
this is a template to use for new data science projects in the aiops group
☆10May 13, 2026Updated 2 months ago
IBM / super
View on GitHub
CLI for the Serverless Supercomputer
☆25Sep 17, 2025Updated 10 months ago
openshift / sriov-network-device-plugin
View on GitHub
An SRIOV device plugin plugin
☆15Jul 16, 2026Updated last week
nabla-containers / nabla-containers.github.io
View on GitHub
Nabla Containers blog
☆12May 26, 2021Updated 5 years ago
modelpack / modctl
View on GitHub
Command-line tools for managing OCI model artifacts, which are bundled based on Model Spec
☆78Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ai-dynamo / grove
View on GitHub
Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling
☆245Updated this week
sourcegraph / deploy-sourcegraph-k8s
View on GitHub
Deploy Sourcegraph to a Kubernetes (k8s) cluster with Kustomize for large-scale code search and intelligence
☆15Jul 9, 2026Updated 2 weeks ago
HaleTom / dotfiles
View on GitHub
My dotfiles (managed by stow, XDG cognizant)
☆12Jul 22, 2026Updated last week
kubernetes-sigs / inference-perf
View on GitHub
GenAI inference performance benchmarking tool
☆214Updated this week
IBM / solsa
View on GitHub
Solution Service Architecture
☆26Jun 5, 2024Updated 2 years ago
peterbourgon / lpg
View on GitHub
Local Prometheus and Grafana: work with metrics during development
☆15Aug 5, 2025Updated 11 months ago
llm-d-incubation / llm-d-infra
View on GitHub
llm-d helm charts and deployment examples
☆59May 1, 2026Updated 2 months ago
manusa / actions-setup-minikube
View on GitHub
Set up your GitHub Actions workflow with a specific version of Minikube and Kuberentes
☆62May 21, 2026Updated 2 months ago
russellb / canhazgpu
View on GitHub
A simple GPU reservation tool for single host shared development systems
☆29Jul 6, 2026Updated 3 weeks ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kerthcet / github-workflow-as-kube
View on GitHub
Following the same workflows as Kubernetes. Widely used in InftyAI community.
☆13May 31, 2026Updated last month
jpedro1992 / scheduler-plugins
View on GitHub
Repository for out-of-tree scheduler plugins based on scheduler framework.
☆12Jul 4, 2026Updated 3 weeks ago
llm-d-incubation / llm-d-planner
View on GitHub
☆25Updated this week
gpustack / gguf-packer-go
View on GitHub
Deliver LLMs of GGUF format via Dockerfile.
☆15Oct 24, 2024Updated last year
IBM / ado
View on GitHub
A framework for designing, executing and analysing experiment campaigns
☆56Updated this week
grafana-cold-storage / grafana-build
View on GitHub
GitHub actions and packages for building Grafana
☆12Jun 5, 2026Updated last month
InftyAI / llmaz
View on GitHub
☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!
☆309Jan 26, 2026Updated 6 months ago