vllm-project / vllm-spyreLinks
Community maintained hardware plugin for vLLM on Spyre
☆37Updated this week
Alternatives and similar repositories for vllm-spyre
Users that are interested in vllm-spyre are comparing it to the libraries listed below
Sorting:
- A high-throughput and memory-efficient inference and serving engine for LLMs☆85Updated this week
- A hierarchical collective communications library with portable optimizations☆37Updated last year
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆13Updated 8 months ago
- MAD (Model Automation and Dashboarding)☆30Updated last week
- Systematic and comprehensive benchmarks for LLM systems.☆44Updated 3 weeks ago
- llm-d benchmark scripts and tooling☆39Updated this week
- NVIDIA NCCL Tests for Distributed Training☆129Updated this week
- This is the public repo for the MLPerf DeepCAM climate data segmentation proposal.☆16Updated 2 months ago
- A tool to detect infrastructure issues on cloud native AI systems☆52Updated 3 months ago
- Cloud Native Benchmarking of Foundation Models☆44Updated 4 months ago
- ☆16Updated last month
- COCCL: Compression and precision co-aware collective communication library☆29Updated 9 months ago
- rocSHMEM intra-kernel networking runtime for AMD dGPUs on the ROCm platform.☆133Updated this week
- ☆24Updated 2 months ago
- Reference implementations of MLPerf™ HPC training benchmarks☆49Updated 9 months ago
- Offline optimization of your disaggregated Dynamo graph☆128Updated this week
- CloudAI Benchmark Framework☆77Updated this week
- A recommendation model kernel optimizing system☆12Updated 6 months ago
- An I/O benchmark for deep Learning applications☆94Updated last week
- This is a plugin which lets EC2 developers use libfabric as network provider while running NCCL applications.☆201Updated last week
- CUDA GPU Benchmark☆35Updated 10 months ago
- Intel® Extension for DeepSpeed* is an extension to DeepSpeed that brings feature support with SYCL kernels on Intel GPU(XPU) device. Note…☆63Updated 5 months ago
- Parallel Code Evaluation Benchmark☆39Updated last month
- Magnum IO community repo☆105Updated 2 weeks ago
- A Micro-benchmarking Tool for HPC Networks☆33Updated 3 months ago
- RCCL Performance Benchmark Tests☆82Updated last week
- [DEPRECATED] Moved to ROCm/rocm-systems repo☆29Updated last week
- Multi-GPU communication profiler and visualizer☆37Updated last year
- Ongoing research training transformer models at scale☆34Updated last week
- ☆17Updated 3 months ago