Simplified model deployment on llm-d
☆28Jul 2, 2025Updated 8 months ago
Alternatives and similar repositories for llm-d-model-service
Users that are interested in llm-d-model-service are comparing it to the libraries listed below
Sorting:
- llm-d benchmark scripts and tooling☆48Updated this week
- Incubating P/D sidecar for llm-d☆16Nov 13, 2025Updated 3 months ago
- Kubernetes CSI Driver for serving OCI model artifacts☆24Updated this week
- Distributed KV cache scheduling & offloading libraries☆108Updated this week
- helm charts for deploying models with llm-d☆28Updated this week
- Helm charts for llm-d☆52Jul 22, 2025Updated 7 months ago
- A light weight vLLM simulator, for mocking out replicas.☆87Updated this week
- A tool for coordinated checkpoint/restore of distributed applications with CRIU☆31Updated this week
- Vue's plugin to easily integrate pagination.☆10Oct 30, 2018Updated 7 years ago
- Artifacts for the Distributed Workloads stack as part of ODH☆33Feb 26, 2026Updated last week
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆34Aug 7, 2025Updated 7 months ago
- llm-d helm charts and deployment examples☆50Feb 26, 2026Updated last week
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 3 months ago
- Achieve state of the art inference performance with modern accelerators on Kubernetes☆2,543Feb 28, 2026Updated last week
- A workload for deploying LLM inference services on Kubernetes☆179Updated this week
- A minimal provisioning agent designed for Azure Linux VMs.☆15Feb 18, 2026Updated 2 weeks ago
- Arks is a cloud-native inference framework running on Kubernetes☆46Jan 14, 2026Updated last month
- 🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.☆13Jan 30, 2026Updated last month
- A Rust library for reading a user's Docker credentials from config.☆10Sep 30, 2025Updated 5 months ago
- Scripts and bits to help with ManageIQ daily dev work☆10Oct 29, 2019Updated 6 years ago
- Console for Kamaji, the Kubernetes Control Plane Manager☆15Oct 30, 2025Updated 4 months ago
- Key-value store in Go with segmented logs, compaction, bloom filters & HTTP API☆22Jan 28, 2026Updated last month
- A CSV formatted file using data from the Yelp Academic Dataset.☆12May 7, 2016Updated 9 years ago
- Hyper Parameter Optimization☆13Feb 7, 2025Updated last year
- These are the files used to create the evezor mass production coaster demo http://evezor.com/coasters☆11May 16, 2017Updated 8 years ago
- handle push notifications & other stuff☆10May 22, 2023Updated 2 years ago
- [DEPRECATED] Prometheus exporter for VPA recommendations☆12Aug 22, 2023Updated 2 years ago
- A performance testing and analysis automation framework☆14Feb 27, 2026Updated last week
- ☆10Apr 30, 2020Updated 5 years ago
- A limit order match engine and backend service with simple account management using RESTful API in Rust-lang.☆17Jan 3, 2023Updated 3 years ago
- Dll hijack -- just one macro☆13Jul 3, 2023Updated 2 years ago
- ☆11Dec 16, 2025Updated 2 months ago
- ☆10Apr 7, 2020Updated 5 years ago
- A CLI tool for running AI agents inside microVM sandboxes☆30Updated this week
- GitHub actions and packages for building Grafana☆12Updated this week
- CPU DRA Driver☆32Feb 27, 2026Updated last week
- my docker docker-compose kubernetes shell start/stop/delete/cleanup ... scripts☆16Jan 7, 2021Updated 5 years ago
- Bits of Terraform that you can use to do bad things in CI/CD pipelines that run Terraform☆10Nov 10, 2020Updated 5 years ago
- Satoru keeper service 🦀.☆14Aug 9, 2024Updated last year