vllm-project / vllm-spyreLinks
Community maintained hardware plugin for vLLM on Spyre
☆26Updated this week
Alternatives and similar repositories for vllm-spyre
Users that are interested in vllm-spyre are comparing it to the libraries listed below
Sorting:
- llm-d benchmark scripts and tooling☆17Updated this week
- Cloud Native Benchmarking of Foundation Models☆38Updated 2 weeks ago
- ☆12Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆76Updated this week
- A hierarchical collective communications library with portable optimizations☆35Updated 6 months ago
- Anatomy of High-Performance GEMM with Online Fault Tolerance on GPUs☆12Updated 2 months ago
- A tool to detect infrastructure issues on cloud native AI systems☆41Updated last month
- OpenAI Triton backend for Intel® GPUs☆191Updated this week
- Ongoing research training transformer models at scale☆23Updated 2 weeks ago
- Caikit is an AI toolkit that enables users to manage models through a set of developer friendly APIs.☆106Updated this week
- NVIDIA NCCL Tests for Distributed Training☆97Updated last week
- oneAPI Technical Advisory Board (TAB) Meeting Notes☆72Updated last year
- Bridge operator repo☆21Updated last month
- Create and deploy virtual-experiments - co-processing computational workflows☆10Updated 2 months ago
- High Performance Linpack for Next-Generation AMD HPC Accelerators☆55Updated last week
- Large Language Model Text Generation Inference on Habana Gaudi☆33Updated 3 months ago
- Reference models for Intel(R) Gaudi(R) AI Accelerator☆164Updated last month
- Provides the examples to write and build Habana custom kernels using the HabanaTools☆21Updated 2 months ago
- A multi-platform experimentation framework written in python.☆56Updated this week
- Magnum IO community repo☆95Updated last month
- RDC☆29Updated this week
- IBM Z Deep Neural Network Library (zDNN) provides an interface for applications making use of Neural Network Processing Assist Facility (…☆16Updated 2 months ago
- ROCm Communication Collectives Library (RCCL)☆342Updated this week
- ☆20Updated 3 months ago
- RDMA and SHARP plugins for nccl library☆197Updated last week
- Systematic and comprehensive benchmarks for LLM systems.☆17Updated last week
- ROC profiler library. Profiling with perf-counters and derived metrics.☆148Updated last week
- A validation and profiling tool for AI infrastructure☆317Updated this week
- ☆19Updated this week
- QuickReduce is a performant all-reduce library designed for AMD ROCm that supports inline compression.☆28Updated 3 months ago