Simplified model deployment on llm-d
☆28Jul 2, 2025Updated 9 months ago
Alternatives and similar repositories for llm-d-model-service
Users that are interested in llm-d-model-service are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Incubating P/D sidecar for llm-d☆16Nov 13, 2025Updated 5 months ago
- helm charts for deploying models with llm-d☆30Updated this week
- Inference scheduler for llm-d☆168Updated this week
- Helm charts for llm-d☆52Jul 22, 2025Updated 8 months ago
- A lightweight, configurable, and real-time simulator designed to mimic the behavior of vLLM without the need for GPUs or running actual h…☆113Apr 9, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Kubernetes CSI Driver for serving OCI model artifacts☆25Mar 23, 2026Updated 3 weeks ago
- Distributed KV cache scheduling & offloading libraries☆126Apr 11, 2026Updated last week
- Like `kubectl get all`, but get really all resources☆30Apr 8, 2026Updated last week
- Kubernetes release optimizer☆268Aug 1, 2024Updated last year
- ☆10Apr 30, 2020Updated 5 years ago
- A Golang library for analyzing k8s connectivity-configuration resources (a.k.a. network policies)☆19Feb 1, 2026Updated 2 months ago
- Augmented Dickey-Fuller implementation in Go☆12Mar 15, 2019Updated 7 years ago
- [DEPRECATED] Prometheus exporter for VPA recommendations☆12Aug 22, 2023Updated 2 years ago
- ☆12Dec 24, 2025Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- this is a template to use for new data science projects in the aiops group☆10Apr 19, 2023Updated 2 years ago
- RPG is tool, that guides people through the creation of a RPM package☆22Mar 16, 2016Updated 10 years ago
- CLI for the Serverless Supercomputer☆25Sep 17, 2025Updated 7 months ago
- KubeStellar - a flexible solution for multi-cluster configuration management for edge, multi-cloud, and hybrid cloud☆652Apr 9, 2026Updated last week
- A workload for deploying LLM inference services on Kubernetes☆203Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.☆12Apr 3, 2026Updated 2 weeks ago
- Deploy Sourcegraph to a Kubernetes (k8s) cluster with Kustomize for large-scale code search and intelligence☆14Updated this week
- Local Kubernetes Benchmark☆13Nov 4, 2022Updated 3 years ago
- My dotfiles (managed by stow, XDG cognizant)☆12Apr 5, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Explore external scalers built by the community.☆12Mar 23, 2026Updated 3 weeks ago
- Set up your GitHub Actions workflow with a specific version of Minikube and Kuberentes☆62Mar 12, 2026Updated last month
- Following the same workflows as Kubernetes. Widely used in InftyAI community.☆13Dec 5, 2025Updated 4 months ago
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆12Mar 6, 2026Updated last month
- Easy Scheduler是一个分布式工作流任务调度系统,主要解决数据研发ETL错综复杂的依赖关系,而不能直观监控任务健康状态等问题。Easy Scheduler以DAG流式的方式将Task组装起来,可实时监控任务的运行状态,同时支持重试、从指定节点恢复失败、暂停及Kil…☆10Apr 9, 2019Updated 7 years ago
- Rust library and CLI tool for reading Alpine Linux’s apk package format and APKBUILD☆10Jan 19, 2026Updated 3 months ago
- Fauxpenshift is a Kubernetes Cluster + The OpenShift Router. That's it!☆23Nov 19, 2022Updated 3 years ago
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 4 months ago
- GitHub actions and packages for building Grafana☆12Mar 9, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A tool for coordinated checkpoint/restore of distributed applications with CRIU☆32Mar 2, 2026Updated last month
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆42Aug 7, 2025Updated 8 months ago
- ☸️ Easy, advanced inference platform for large language models on Kubernetes. 🌟 Star to support our work!☆296Jan 26, 2026Updated 2 months ago
- OpenShift, Kubernetes and Docker Performance Research☆25Mar 17, 2016Updated 10 years ago
- Autoscaling components for Kubernetes☆21Mar 31, 2026Updated 2 weeks ago
- llm-d helm charts and deployment examples☆54Apr 2, 2026Updated 2 weeks ago
- The Intelligent Inference Scheduler for Large-scale Inference Services.☆66Feb 12, 2026Updated 2 months ago