vllm-project/vllm-spyre

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vllm-project/vllm-spyre)

vllm-project / vllm-spyre

Community maintained hardware plugin for vLLM on Spyre

☆49

Alternatives and similar repositories for vllm-spyre

Users that are interested in vllm-spyre are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

foundation-model-stack / fms-model-optimizer
View on GitHub
FMS Model Optimizer is a framework for developing reduced precision neural network models.
☆21Updated this week
fmperf-project / fmperf
View on GitHub
Cloud Native Benchmarking of Foundation Models
☆45Jul 31, 2025Updated 7 months ago
foundation-model-stack / fms-acceleration
View on GitHub
🚀 Collection of libraries used with fms-hf-tuning to accelerate fine-tuning and training of large models.
☆14Jan 30, 2026Updated last month
qiskit-community / spank-plugins
View on GitHub
Slurm Spank plugins for Quantum resources and jobs support
☆49Mar 7, 2026Updated 3 weeks ago
RBLN-SW / vllm-rbln
View on GitHub
vLLM plugin for RBLN NPU
☆44Updated this week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
llm-d-incubation / llm-d-modelservice
View on GitHub
helm charts for deploying models with llm-d
☆30Mar 17, 2026Updated last week
HicrestLaboratory / SPARTA
View on GitHub
SParse AcceleRation on Tensor Architecture
☆18Apr 7, 2025Updated 11 months ago
conda-forge / openmpi-feedstock
View on GitHub
A conda-smithy repository for openmpi.
☆13Mar 16, 2026Updated last week
containers / ramalama-stack
View on GitHub
An external provider for Llama Stack allowing for the use of RamaLama for inference.
☆21Dec 22, 2025Updated 3 months ago
jjasghar / cloud-native-python-example-app
View on GitHub
A simple app to help play and demo Cloud Native things
☆28Nov 8, 2023Updated 2 years ago
probabilistic-inference-scaling / probabilistic-inference-scaling
View on GitHub
☆52Mar 17, 2025Updated last year
Kuadrant / mcp-gateway
View on GitHub
An envoy-based MCP Gateway
☆56Updated this week
IBM / repo-template
View on GitHub
template repo with recommended content for projects under the IBM org
☆39Dec 22, 2025Updated 3 months ago
llm-d / llm-d-inference-scheduler
View on GitHub
Inference scheduler for llm-d
☆156Updated this week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
huggingface / lora-fast
View on GitHub
Minimal repository to demonstrate fast LoRA inference with Flux family of models.
☆31Jul 23, 2025Updated 8 months ago
astra-sim / tacos
View on GitHub
TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning
☆32Jun 13, 2025Updated 9 months ago
yejinc00 / PREMIR
View on GitHub
[EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"
☆15Aug 26, 2025Updated 7 months ago
bheemhh1 / 3D_NoC
View on GitHub
Extending BookSim2.0 and HotSpot6.0 for Power, Performance and Thermal evaluation of 3D NoC Architectures
☆13Aug 9, 2019Updated 6 years ago
IBM / ibm-cloud-functions-message-hub-trigger
View on GitHub
IBM Cloud Functions building block - Message Hub Trigger - This project provides a starting point for handling events from Message Hub wi…
☆11Apr 23, 2019Updated 6 years ago
IBM / Simplify-Mainframe-application-deployments-using-Ansible
View on GitHub
Simplify mainframe application deployments using Ansible
☆11Jun 18, 2021Updated 4 years ago
flutter-tizen / embedder
View on GitHub
Flutter embedder for Tizen
☆13Updated this week
project-codeflare / instaslice
View on GitHub
InstaSlice facilitates the use of Dynamic Resource Allocation (DRA) on Kubernetes clusters for GPU sharing
☆30Nov 27, 2024Updated last year
IBM / super
View on GitHub
CLI for the Serverless Supercomputer
☆25Sep 17, 2025Updated 6 months ago
NordVPN Special Discount Offer • Ad
Save on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
foundation-model-stack / fms-dgt
View on GitHub
Synthetic Data Generation for Foundation Models
☆21Nov 10, 2025Updated 4 months ago
hlaueriksson / playwright-dotnet-contrib
View on GitHub
Contributions to Playwright for .NET 🎭🧪
☆12Nov 20, 2023Updated 2 years ago
zorse-project / COBOLEval
View on GitHub
Evaluate LLM-generated COBOL
☆43May 9, 2024Updated last year
josch / cycles_hawick_james
View on GitHub
Finding all the circuits of a directed graph with self-arcs and multiple-arcs by K.A. Hawick and H.A. James
☆19Apr 11, 2013Updated 12 years ago
Fibertree-Project / fibertree
View on GitHub
Fibertree emulator
☆17Nov 4, 2024Updated last year
KarypisLab / PM4GNN
View on GitHub
Graph partitioning for distributed GNN training
☆13Mar 26, 2023Updated 3 years ago
ansible-collections / ibm_zos_ims
View on GitHub
IBM z/OS IMS Collection
☆15Mar 18, 2026Updated last week
IBM / zOSMF
View on GitHub
This is for management of IBM z/OS Management Facility (z/OSMF) One Stop Hub website and sample code.
☆10Nov 5, 2025Updated 4 months ago
BG2BKK / my_benchmark
View on GitHub
benchmark for linux server
☆13Nov 6, 2016Updated 9 years ago
Wordpress hosting with auto-scaling on Cloudways • Ad
Fully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
marcusthierfelder / mpi
View on GitHub
mpi-binding for golang
☆28May 11, 2021Updated 4 years ago
astra-sim / symbolic_tensor_graph
View on GitHub
☆39Aug 25, 2025Updated 7 months ago
ambitus / pyzkiln
View on GitHub
A set of Python building blocks to help with z/OS automation
☆16May 20, 2025Updated 10 months ago
radha-patel / SySTeC
View on GitHub
Performant kernels for symmetric tensors
☆16Aug 22, 2024Updated last year
IBM / developer
View on GitHub
☆10Aug 28, 2018Updated 7 years ago
AndreasBergmeister / graph-generation
View on GitHub
Reference implementation of the paper "Efficient and Scalable Graph Generation through Iterative Local Expansion"
☆16Aug 27, 2025Updated 7 months ago