llm-d / llm-d-benchmarkLinks
llm-d benchmark scripts and tooling
☆18Updated this week
Alternatives and similar repositories for llm-d-benchmark
Users that are interested in llm-d-benchmark are comparing it to the libraries listed below
Sorting:
- A tool to detect infrastructure issues on cloud native AI systems☆42Updated last month
- Systematic and comprehensive benchmarks for LLM systems.☆19Updated 2 weeks ago
- Cloud Native Benchmarking of Foundation Models☆38Updated last month
- Community maintained hardware plugin for vLLM on Spyre☆30Updated this week
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others☆43Updated 9 months ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆41Updated this week
- Create and deploy virtual-experiments - co-processing computational workflows☆10Updated this week
- ☆45Updated 4 months ago
- Intent Driven Orchestration enables management of applications through their Service Level Objectives, while minimizing developer and adm…☆38Updated 3 months ago
- NVIDIA Network Operator☆263Updated this week
- A collection of community maintained NRI plugins☆85Updated 2 weeks ago
- Enabling Kubernetes to make pod placement decisions with platform intelligence.☆175Updated 5 months ago
- Bridge operator repo☆21Updated 2 months ago
- The operator manages the ovn-kube components running on the DPU card for enabling OVS hardware offloading.☆28Updated 2 months ago
- Trusted Service Identity is closing the gap of preventing access to secrets by an untrusted operator during the process of obtaining auth…☆27Updated 2 months ago
- Kubernetes Container Runtime Interface proxy service with hardware resource aware workload placement policies☆179Updated 3 months ago
- Holistic job manager on Kubernetes☆117Updated last year
- ☆253Updated 3 weeks ago
- Inference scheduler for llm-d☆67Updated this week
- Health checks for Azure N- and H-series VMs.☆46Updated 2 weeks ago
- Simplified model deployment on llm-d☆25Updated 2 weeks ago
- ☆43Updated last year
- MIG Partition Editor for NVIDIA GPUs☆204Updated last week
- An Operator for deployment and maintenance of NVIDIA NIMs and NeMo microservices in a Kubernetes environment.☆117Updated this week
- DOCA Platform manages provisioning and service orchestration for Bluefield DPUs☆44Updated last month
- A light weight vLLM simulator, for mocking out replicas.☆30Updated this week
- Test Orchestrator for Performance and Scalability of AI pLatforms☆15Updated this week
- OpenShift Migration Controller☆22Updated 2 weeks ago
- open-cluster-management governance material.☆64Updated last week
- The Network Plumbing Working Group Community information☆24Updated last month