llm-d / llm-d-workload-variant-autoscalerView external linksLinks
Variant optimization autoscaler for distributed inference workloads
☆29Updated this week
Alternatives and similar repositories for llm-d-workload-variant-autoscaler
Users that are interested in llm-d-workload-variant-autoscaler are comparing it to the libraries listed below
Sorting:
- SRO supports out-of-tree and third-party kernel drivers and the support software for the node OS via containers.☆16Updated this week
- Test Orchestrator for Performance and Scalability of AI pLatforms☆16Jan 26, 2026Updated 3 weeks ago
- Helm charts for llm-d☆52Jul 22, 2025Updated 6 months ago
- Cloud Native Benchmarking of Foundation Models☆45Jul 31, 2025Updated 6 months ago
- Kubernetes-native AI serving platform for scalable model serving.☆217Updated this week
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆50Updated this week
- Linux kernel source tree☆10Oct 11, 2017Updated 8 years ago
- Framework for writing tests for RISC-V CPU/SOC validation.☆11Jan 19, 2026Updated 3 weeks ago
- Model Express is a Rust-based component meant to be placed next to existing model inference systems to speed up their startup times and i…☆28Updated this week
- ☆15Aug 7, 2025Updated 6 months ago
- A Golang Prometheus back-filling library☆10May 30, 2023Updated 2 years ago
- A light weight vLLM simulator, for mocking out replicas.☆85Updated this week
- A script to reorganize 'Want to go' Saved places in Google Maps into separate lists by category.☆12May 14, 2024Updated last year
- LF AI & Data Foundation related logos and artwork☆11Jan 30, 2026Updated 2 weeks ago
- Console for Kamaji, the Kubernetes Control Plane Manager☆15Oct 30, 2025Updated 3 months ago
- Provides deploy scripts and CSI for Lustre.☆14Oct 27, 2025Updated 3 months ago
- ☆18Jun 18, 2025Updated 7 months ago
- Hack for start other istance of wpa_supplicant daemon☆13Nov 16, 2017Updated 8 years ago
- ☆10Aug 29, 2024Updated last year
- ☆12Oct 29, 2012Updated 13 years ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- An example of how to use the kubernetai plugin to do multicluster DNS-based service discovery☆10Nov 21, 2019Updated 6 years ago
- Donovan and Kernighan The Go Programming Language Examples☆14Oct 14, 2020Updated 5 years ago
- This repo hold information on the open-standard OVP APIs☆17Dec 11, 2025Updated 2 months ago
- d.run website☆15Feb 9, 2026Updated last week
- ☆11Mar 15, 2023Updated 2 years ago
- A Kubernetes controller designed to oversee Persistent Volume Claims (PVCs) associated with local storage on worker nodes. Its purpose is…☆14Nov 10, 2025Updated 3 months ago
- OpenShift Cluster Autoscaler Must Gather Investigator☆14Jul 12, 2024Updated last year
- ☆16Updated this week
- underlay plugins for macvlan and sriov-cni☆11Nov 17, 2023Updated 2 years ago
- Intel® AI for Enterprise Inference optimizes AI inference services on Intel hardware using Kubernetes Orchestration. It automates LLM mod…☆33Updated this week
- ☆12Oct 31, 2016Updated 9 years ago
- Prototypes and experiments for WG Device Management.☆14Updated this week
- A kubectl plugin to debug Pods from an IDE rather than the CLI☆10Dec 19, 2024Updated last year
- Collection of bet practices, reference architectures, examples, and utilities for foundation model development and deployment on AWS.☆14Jul 22, 2025Updated 6 months ago
- ☆16Feb 6, 2026Updated last week
- Collection of my Reinforcement Learning (RL) practices including DQN, D3QN, and Adaptive Gamma, applied to the Lunar Lander and CartPole …☆16Oct 21, 2024Updated last year
- CPU DRA Driver☆31Feb 9, 2026Updated last week
- fzf-based test selection with pytest☆14Jan 26, 2026Updated 3 weeks ago