Helm charts for llm-d
☆52Jul 22, 2025Updated 7 months ago
Alternatives and similar repositories for llm-d-deployer
Users that are interested in llm-d-deployer are comparing it to the libraries listed below
Sorting:
- Incubating P/D sidecar for llm-d☆16Nov 13, 2025Updated 3 months ago
- A light weight vLLM simulator, for mocking out replicas.☆87Updated this week
- llm-d helm charts and deployment examples☆50Feb 26, 2026Updated last week
- Inference scheduler for llm-d☆135Updated this week
- Variant optimization autoscaler for distributed inference workloads☆33Updated this week
- Distributed KV cache scheduling & offloading libraries☆108Updated this week
- Simplified model deployment on llm-d☆28Jul 2, 2025Updated 8 months ago
- ☆18Jun 18, 2025Updated 8 months ago
- llm-d benchmark scripts and tooling☆48Updated this week
- ☆15Feb 26, 2026Updated last week
- Introduction to Red Hat OpenShift AI (RHOAI)☆14Apr 11, 2024Updated last year
- Achieve state of the art inference performance with modern accelerators on Kubernetes☆2,543Updated this week
- Automatically scales Kubernetes controllers to zero☆16May 30, 2019Updated 6 years ago
- ☆20Feb 28, 2026Updated last week
- Collection of demos for building Llama Stack based apps on OpenShift☆61Feb 26, 2026Updated last week
- An Envoy inspired, ultimate LLM-first gateway for LLM serving and downstream application developers and enterprises☆26Apr 24, 2025Updated 10 months ago
- Gateway API Inference Extension☆597Updated this week
- This project makes running the InstructLab large language model (LLM) fine-tuning process easy and flexible on OpenShift☆27Aug 27, 2025Updated 6 months ago
- A collection of useful Go libraries to ease the development of NVIDIA Operators for GPU/NIC management.☆29Updated this week
- caniuse.com, but for kubernetes☆27Dec 25, 2024Updated last year
- label ALL kubectl, kustomize, and helm objects, inline, without extra steps.(including namespaces and CRDs)☆15Apr 22, 2024Updated last year
- SaaS for Tekton Pipelines☆23Aug 15, 2024Updated last year
- ☆31Apr 19, 2025Updated 10 months ago
- Code for "Re-evaluating Word Mover’s Distance" (ICML 2022)☆40Jun 15, 2022Updated 3 years ago
- Cloud Native Benchmarking of Foundation Models☆45Jul 31, 2025Updated 7 months ago
- A benchmarking tool for comparing different LLM API providers' DeepSeek model deployments.☆30Mar 28, 2025Updated 11 months ago
- GenAI inference performance benchmarking tool☆151Feb 27, 2026Updated last week
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- Digital SuperTwin: digital twin of supercomputers☆13Nov 24, 2024Updated last year
- ☆15Sep 19, 2021Updated 4 years ago
- This is the Hobbyist Guide to Installing and Configuring RHOAI for customers. Bring your towel.☆14Apr 15, 2025Updated 10 months ago
- [ICLR2026] The code for "Interp3D: Correspondence-Aware Interpolation for Generative Textured 3D Morphing."☆24Jan 21, 2026Updated last month
- ☆10Aug 15, 2022Updated 3 years ago
- All-in-one environment to use Dria, the collective knowledge for AI.☆14Mar 15, 2024Updated last year
- ☆15Aug 7, 2025Updated 7 months ago
- Workshop on Text Classification at 1729 Conference☆13Sep 4, 2022Updated 3 years ago
- Terraform modules and Ansible playbook for Apache SkyWalking☆12Mar 11, 2024Updated last year
- Kubernetes service bindings utility☆15Feb 14, 2026Updated 2 weeks ago
- this is a template to use for new data science projects in the aiops group☆10Apr 19, 2023Updated 2 years ago