llm-d / llm-d-benchmarkLinks
llm-d benchmark scripts and tooling
☆17Updated last week
Alternatives and similar repositories for llm-d-benchmark
Users that are interested in llm-d-benchmark are comparing it to the libraries listed below
Sorting:
- Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others☆42Updated 8 months ago
- InstaSlice Operator facilitates slicing of accelerators using stable APIs☆38Updated this week
- Community maintained hardware plugin for vLLM on Spyre☆26Updated this week
- Inference scheduler for llm-d☆57Updated this week
- Simplified model deployment on llm-d☆24Updated 3 weeks ago
- A tool to detect infrastructure issues on cloud native AI systems☆39Updated last month
- Test Orchestrator for Performance and Scalability of AI pLatforms☆15Updated this week
- Helm charts for llm-d☆43Updated this week
- Trusted Service Identity is closing the gap of preventing access to secrets by an untrusted operator during the process of obtaining auth…☆27Updated last month
- Cloud Native Benchmarking of Foundation Models☆38Updated 2 weeks ago
- Systematic and comprehensive benchmarks for LLM systems.☆17Updated last week
- OpenShift Migration Controller☆22Updated this week
- InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing☆28Updated 6 months ago
- Seamlessly use VM based TEEs with Kubernetes for data-in use protection☆36Updated 3 years ago
- Model Server for Kepler☆27Updated this week
- ODH integration with AI at the Edge usecases☆12Updated 7 months ago
- A light weight vLLM simulator, for mocking out replicas.☆26Updated this week
- The project delivers a comprehensive full-stack solution for the Intel® Enterprise AI Foundation on the OpenShift platform to provision I…☆20Updated 3 weeks ago
- The kernel module management operator builds, signs and loads kernel modules on OpenShift.☆28Updated this week
- Operator for RHOCS