vllm-project / vllm-spyreLinks
Community maintained hardware plugin for vLLM on Spyre
☆30Updated last week
Alternatives and similar repositories for vllm-spyre
Users that are interested in vllm-spyre are comparing it to the libraries listed below
Sorting:
- llm-d benchmark scripts and tooling☆21Updated this week
- ☆16Updated 4 months ago
- ☆22Updated 2 weeks ago
- ☆12Updated this week
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Updated 3 years ago
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆12Updated 4 months ago
- A CUTLASS implementation using SYCL☆32Updated this week
- Cloud Native Benchmarking of Foundation Models☆39Updated last week
- Reference implementations of MLPerf™ HPC training benchmarks☆48Updated 5 months ago
- ☆18Updated 2 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆78Updated this week
- A tool to detect infrastructure issues on cloud native AI systems☆44Updated 2 weeks ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆61Updated last month
- RDC☆29Updated this week
- CPU and GPU tutorial examples☆13Updated 4 months ago
- TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning☆25Updated last month
- ☆50Updated this week
- A hierarchical collective communications library with portable optimizations☆36Updated 8 months ago
- COCCL: Compression and precision co-aware collective communication library☆24Updated 4 months ago
- ☆47Updated last week
- Create and deploy virtual-experiments - co-processing computational workflows☆10Updated 3 weeks ago
- ☆8Updated 3 weeks ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆98Updated this week
- ☆12Updated 8 months ago
- A Micro-benchmarking Tool for HPC Networks☆32Updated 2 weeks ago
- NVIDIA NCCL Tests for Distributed Training☆102Updated 2 weeks ago
- Ongoing research training transformer models at scale☆24Updated this week
- MAD (Model Automation and Dashboarding)☆23Updated last week
- High-Performance Linpack Benchmark adopted version for GPU backend☆11Updated 2 years ago
- ☆20Updated last week