MeshInfra/WaferLLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MeshInfra/WaferLLM)

MeshInfra / WaferLLM

WaferLLM: Large Language Model Inference at Wafer Scale

☆112

Alternatives and similar repositories for WaferLLM

Users that are interested in WaferLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

spcl / spatial-collectives
View on GitHub
Optimized communication collectives for the Cerebras waferscale engine
☆17Jun 5, 2024Updated 2 years ago
Cerebras / sdk-examples
View on GitHub
☆47Apr 27, 2026Updated 2 months ago
IPADS-SAI / WaferAI-SIM
View on GitHub
The wafer-native AI accelerator simulation platform and inference engine.
☆56Jan 1, 2026Updated 6 months ago
cherichy / tilecute
View on GitHub
☆32Jul 2, 2025Updated last year
astra-sim / tacos
View on GitHub
TACOS: [T]opology-[A]ware [Co]llective Algorithm [S]ynthesizer for Distributed Machine Learning
☆37Jun 13, 2025Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
foundation-model-stack / vllm-triton-backend
View on GitHub
A Triton-only attention backend for vLLM
☆27Jul 14, 2026Updated last week
tile-ai / tilescale
View on GitHub
Tile-based language built for AI computation across all scales
☆173Jun 16, 2026Updated last month
microsoft / FractalTensor
View on GitHub
FractalTensor is a programming framework that introduces a novel approach to organizing data in deep neural networks (DNNs) as a list of …
☆32Dec 21, 2024Updated last year
infinigence / HamiltonAttention
View on GitHub
☆45Oct 15, 2025Updated 9 months ago
astra-sim / astra-sim
View on GitHub
ASTRA-sim2.0: Modeling Hierarchical Networks and Disaggregated Systems for Large-model Training at Scale
☆641Apr 25, 2026Updated 2 months ago
PrincetonUniversity / LLMCompass
View on GitHub
☆260Oct 24, 2025Updated 8 months ago
TiledTensor / TiledLower
View on GitHub
TiledLower is a Dataflow Analysis and Codegen Framework written in Rust.
☆13Nov 23, 2024Updated last year
Chivier / easy-gpt4o
View on GitHub
Easy-GPT4O opensource version
☆77May 15, 2024Updated 2 years ago
mikeurbach / egg-netlist-synthesizer
View on GitHub
Using e-graphs to synthesize netlists from boolean logic.
☆14Jul 26, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
peichenxie / FPRev
View on GitHub
☆26May 9, 2025Updated last year
scalesim-project / SCALE-Sim
View on GitHub
Repository to host and maintain SCALE-Sim code
☆498Jun 28, 2026Updated 3 weeks ago
egraphs-good / egraph-serialize
View on GitHub
egraph <-> json
☆17Dec 29, 2025Updated 6 months ago
leesou / H2-LLM-ISCA-2025
View on GitHub
H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference
☆113Apr 26, 2025Updated last year
microsoft / TileIR
View on GitHub
☆31Feb 28, 2025Updated last year
CLab-HKUST-GZ / micro58-axcore
View on GitHub
☆41Oct 21, 2025Updated 9 months ago
Yinxiao-Feng / chiplet-network-sim
View on GitHub
☆65Jun 3, 2025Updated last year
tile-ai / TileOPs
View on GitHub
High-performance LLM operator library built on TileLang.
☆161Updated this week
georgia-tech-synergy-lab / SparseAccelerator-RTL
View on GitHub
Accelerator RTL inspired by VEGETA [HPCA'23] and MicroScopiQ [ISCA'25]
☆15Nov 11, 2025Updated 8 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
SET-Scheduling-Project / GEMINI-HPCA2024
View on GitHub
Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators
☆116Apr 28, 2025Updated last year
cornell-zhang / allo
View on GitHub
Allo Accelerator Design and Programming Framework (PLDI'24)
☆393Updated this week
chenyu-jiang / dcp
View on GitHub
Code repository for the SOSP'25 paper DCP: Addressing Input Dynamism In Long-Context Training via Dynamic Context Parallelism.
☆21Nov 28, 2025Updated 7 months ago
SET-Scheduling-Project / SoMa-HPCA2025
View on GitHub
☆30Feb 27, 2025Updated last year
yc2367 / P3-LLM
View on GitHub
☆23Apr 3, 2026Updated 3 months ago
SJTU-ReArch-Group / M2XFP_ASPLOS26
View on GitHub
[ASPLOS 2026] M2XFP: A Metadata-Augmented Microscaling Data Format for Efficient Low-bit Quantization.
☆15Jan 29, 2026Updated 5 months ago
DeepWok / mase
View on GitHub
Machine-Learning Accelerator System Exploration Tools
☆204Jul 13, 2026Updated last week
lcy-seso / DLFrameworkTest
View on GitHub
My tests and experiments with some popular dl frameworks.
☆17Sep 11, 2025Updated 10 months ago
kvcache-ai / TrEnv-X
View on GitHub
☆95Sep 15, 2025Updated 10 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
pku-liang / Hestia
View on GitHub
☆17Mar 26, 2025Updated last year
PASAUCMerced / Betty
View on GitHub
☆13Feb 16, 2023Updated 3 years ago
huweim / dataflow_architecture
View on GitHub
Research about dataflow architecture
☆15Nov 30, 2023Updated 2 years ago
lucifer1004 / VeloQ
View on GitHub
Agent-friendly GPU profile-query CLI
☆104Jun 22, 2026Updated 3 weeks ago
KULeuven-MICAS / zigzag
View on GitHub
HW Architecture-Mapping Design Space Exploration Framework for Deep Learning Accelerators
☆197Jun 15, 2026Updated last month
horizon-research / imagen
View on GitHub
☆10Mar 8, 2025Updated last year
TiledTensor / TiledCUDA
View on GitHub
We invite you to visit and follow our new repository at https://github.com/microsoft/TileFusion. TiledCUDA is a highly efficient kernel …
☆192Jan 28, 2025Updated last year