Variant optimization autoscaler for distributed inference workloads
☆33Mar 6, 2026Updated this week
Alternatives and similar repositories for llm-d-workload-variant-autoscaler
Users that are interested in llm-d-workload-variant-autoscaler are comparing it to the libraries listed below
Sorting:
- Test Orchestrator for Performance and Scalability of AI pLatforms☆16Feb 27, 2026Updated last week
- Helm charts for llm-d☆52Jul 22, 2025Updated 7 months ago
- Kubernetes-native AI serving platform for scalable model serving.☆249Updated this week
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆50Updated this week
- ☆15Aug 7, 2025Updated 7 months ago
- A Golang Prometheus back-filling library☆10May 30, 2023Updated 2 years ago
- ☆18Jun 18, 2025Updated 8 months ago
- ☆10Aug 29, 2024Updated last year
- Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and i…☆33Updated this week
- Example code for using Alloy with the Internet Computer.☆10Oct 13, 2025Updated 4 months ago
- ☆14Updated this week
- Unofficial Go SDK for Open AI. A lightweight, powerful framework for multi-agent workflows.☆33Mar 2, 2026Updated last week
- LF AI & Data Foundation related logos and artwork☆11Jan 30, 2026Updated last month
- Hack for start other istance of wpa_supplicant daemon☆13Nov 16, 2017Updated 8 years ago
- Provides deploy scripts and CSI for Lustre.☆14Oct 27, 2025Updated 4 months ago
- A Git remote helper for the Internet Computer Protocol.☆12Feb 23, 2023Updated 3 years ago
- A script to reorganize 'Want to go' Saved places in Google Maps into separate lists by category.☆11May 14, 2024Updated last year
- A light weight vLLM simulator, for mocking out replicas.☆96Updated this week
- An operator for managing workload placement in Openshift clusters with compute nodes of varying architectures☆14Feb 2, 2026Updated last month
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- Self-hosted huggingface mirror service. 自建huggingface镜像服务。☆217Jul 16, 2025Updated 7 months ago
- OpenShift Cluster Autoscaler Must Gather Investigator☆14Jul 12, 2024Updated last year
- Cluster doctor skills☆14Feb 20, 2026Updated 2 weeks ago
- fzf-based test selection with pytest☆14Jan 26, 2026Updated last month
- CPU DRA Driver☆33Updated this week
- underlay plugins for macvlan and sriov-cni☆11Nov 17, 2023Updated 2 years ago
- d.run website☆16Feb 26, 2026Updated last week
- ☆16Feb 6, 2026Updated last month
- A Kubernetes controller designed to oversee Persistent Volume Claims (PVCs) associated with local storage on worker nodes. Its purpose is…☆14Nov 10, 2025Updated 3 months ago
- An example of how to use the kubernetai plugin to do multicluster DNS-based service discovery☆10Nov 21, 2019Updated 6 years ago
- Donovan and Kernighan The Go Programming Language Examples☆14Oct 14, 2020Updated 5 years ago
- Intel® AI for Enterprise Inference optimizes AI inference services on Intel hardware using Kubernetes Orchestration. It automates LLM mod…☆36Updated this week
- A kubectl plugin to debug Pods from an IDE rather than the CLI☆10Dec 19, 2024Updated last year
- The Gitops Release Manager☆24Feb 15, 2022Updated 4 years ago
- Issue tracker for OCP on ARM64 dev preview releases☆13Mar 14, 2022Updated 3 years ago
- Let CI Robot automatically execute commands for your PR/issue in your Github repository, hosting on Github Action does not require your s…☆13Feb 9, 2026Updated last month
- Rancher Prime GC Catalog☆13Updated this week
- ☆16Apr 1, 2025Updated 11 months ago
- helm repo add daocloud https://daocloud.github.io/dce-charts-repackage/☆12Updated this week