Simplified model deployment on llm-d
☆28Jul 2, 2025Updated 8 months ago
Alternatives and similar repositories for llm-d-model-service
Users that are interested in llm-d-model-service are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Incubating P/D sidecar for llm-d☆16Nov 13, 2025Updated 4 months ago
- helm charts for deploying models with llm-d☆29Mar 17, 2026Updated last week
- Inference scheduler for llm-d☆142Mar 19, 2026Updated last week
- Helm charts for llm-d☆52Jul 22, 2025Updated 8 months ago
- A lightweight, configurable, and real-time simulator designed to mimic the behavior of vLLM without the need for GPUs or running actual h…☆103Mar 19, 2026Updated last week
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Kubernetes CSI Driver for serving OCI model artifacts☆24Updated this week
- Distributed KV cache scheduling & offloading libraries☆117Mar 20, 2026Updated last week
- Automatically scales Kubernetes controllers to zero☆16May 30, 2019Updated 6 years ago
- Build and deploy Node.js application on Kubernetes☆16Sep 17, 2025Updated 6 months ago
- Kubernetes release optimizer☆266Aug 1, 2024Updated last year
- A Golang library for analyzing k8s connectivity-configuration resources (a.k.a. network policies)☆19Feb 1, 2026Updated last month
- Augmented Dickey-Fuller implementation in Go☆12Mar 15, 2019Updated 7 years ago
- Scripts and bits to help with ManageIQ daily dev work☆10Oct 29, 2019Updated 6 years ago
- [DEPRECATED] Prometheus exporter for VPA recommendations☆12Aug 22, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- this is a template to use for new data science projects in the aiops group☆10Apr 19, 2023Updated 2 years ago
- Go library for generating "git-compatible" patches☆14Feb 23, 2024Updated 2 years ago
- Nabla Containers blog☆12May 26, 2021Updated 4 years ago
- KubeStellar - a flexible solution for multi-cluster configuration management for edge, multi-cloud, and hybrid cloud☆643Mar 20, 2026Updated last week
- ☆10Apr 7, 2020Updated 5 years ago
- Deploy Sourcegraph to a Kubernetes (k8s) cluster with Kustomize for large-scale code search and intelligence☆14Mar 19, 2026Updated last week
- Local Kubernetes Benchmark☆13Nov 4, 2022Updated 3 years ago
- Explore external scalers built by the community.☆12Feb 19, 2026Updated last month
- Auto-tuning for vllm. Getting the best performance out of your LLM deployment (vllm+guidellm+optuna)☆50Mar 17, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A CLI tool for building, testing and composing repositories in the OpenShift ecosystem.☆21Jan 30, 2022Updated 4 years ago
- Set up your GitHub Actions workflow with a specific version of Minikube and Kuberentes☆62Mar 12, 2026Updated 2 weeks ago
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13Dec 5, 2025Updated 3 months ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆12Mar 6, 2026Updated 3 weeks ago
- CLI for creating github gists☆14Apr 20, 2017Updated 8 years ago
- Rust library and CLI tool for reading Alpine Linux’s apk package format and APKBUILD☆10Jan 19, 2026Updated 2 months ago
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 3 months ago
- Fauxpenshift is a Kubernetes Cluster + The OpenShift Router. That's it!☆23Nov 19, 2022Updated 3 years ago
- GitHub actions and packages for building Grafana☆12Mar 9, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆35Aug 7, 2025Updated 7 months ago
- llm-d helm charts and deployment examples☆50Updated this week
- Arks is a cloud-native inference framework running on Kubernetes☆46Jan 14, 2026Updated 2 months ago
- ☆18Jun 7, 2023Updated 2 years ago
- Autoscaling components for Kubernetes☆21Mar 17, 2026Updated last week
- The Intelligent Inference Scheduler for Large-scale Inference Services.☆65Feb 12, 2026Updated last month
- ☆22Sep 29, 2025Updated 5 months ago