vllm-project / vllm-spyreLinks
Community maintained hardware plugin for vLLM on Spyre
☆30Updated this week
Alternatives and similar repositories for vllm-spyre
Users that are interested in vllm-spyre are comparing it to the libraries listed below
Sorting:
- llm-d benchmark scripts and tooling☆18Updated this week
- A tool to detect infrastructure issues on cloud native AI systems☆42Updated last month
- ☆22Updated 2 months ago
- Cloud Native Benchmarking of Foundation Models☆38Updated last month
- ☆12Updated 2 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆77Updated this week
- ☆45Updated 4 months ago
- ☆16Updated 3 months ago
- A novel temporal fusion framework for propelling autoregressive model inference☆11Updated this week
- Magnum IO community repo☆95Updated 2 months ago
- ☆48Updated this week
- A hierarchical collective communications library with portable optimizations☆35Updated 7 months ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆91Updated this week
- Systematic and comprehensive benchmarks for LLM systems.☆19Updated 2 weeks ago
- Reference implementations of MLPerf™ HPC training benchmarks☆48Updated 4 months ago
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆25Updated last month
- Using C++ magic to launch/capture CUDA kernels and tune them with Kernel Tuner☆20Updated last year
- Bandwidth test for ROCm☆60Updated this week
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆61Updated 2 weeks ago
- ☆18Updated last month
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Updated 3 years ago
- Ongoing research training transformer models at scale☆25Updated last week
- Large Language Model Text Generation Inference on Habana Gaudi☆34Updated 3 months ago
- ☆86Updated last week
- RDC☆30Updated this week
- A validation and profiling tool for AI infrastructure☆321Updated last week
- This repo contains documents of the OPEA project☆42Updated last week
- CloudAI Benchmark Framework☆68Updated this week
- NVIDIA NCCL Tests for Distributed Training☆97Updated 3 weeks ago
- Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…☆62Updated 3 weeks ago