leesou/H2-LLM-ISCA-2025

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/leesou/H2-LLM-ISCA-2025)

leesou / H2-LLM-ISCA-2025

H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference

☆100

Alternatives and similar repositories for H2-LLM-ISCA-2025

Users that are interested in H2-LLM-ISCA-2025 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

godfather991 / UniNDP
View on GitHub
Artifact material for [HPCA 2025] #2108 "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"
☆54Sep 1, 2025Updated 8 months ago
SAITPublic / PIMSimulator
View on GitHub
Processing-In-Memory (PIM) Simulator
☆234Dec 12, 2024Updated last year
PSAL-POSTECH / ONNXim
View on GitHub
ONNXim is a fast cycle-level simulator that can model multi-core NPUs for DNN inference
☆199Jan 8, 2026Updated 4 months ago
SET-Scheduling-Project / SoMa-HPCA2025
View on GitHub
☆28Feb 27, 2025Updated last year
upmem / upmem_llm_framework
View on GitHub
UPMEM LLM Framework allows profiling PyTorch layers and functions and simulate those layers/functions with a given hardware profile.
☆41Apr 8, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
CMU-SAFARI / pim-ml
View on GitHub
PIM-ML is a benchmark for training machine learning algorithms on the UPMEM architecture, which is the first publicly-available real-worl…
☆25Jan 7, 2025Updated last year
CMU-SAFARI / ramulator2
View on GitHub
Ramulator 2.0 is a modern, modular, extensible, and fast cycle-accurate DRAM simulator. It provides support for agile implementation and …
☆544Updated this week
Yufeng98 / CENT
View on GitHub
Artifact for paper "PIM is All You Need: A CXL-Enabled GPU-Free System for LLM Inference", ASPLOS 2025
☆134May 3, 2025Updated last year
casys-kaist / mNPUsim
View on GitHub
mNPUsim: A Cycle-accurate Multi-core NPU Simulator (IISWC 2023)
☆73Dec 29, 2025Updated 4 months ago
PSAL-POSTECH / PyTorchSim
View on GitHub
PyTorchSim is a Comprehensive, Fast, and Accurate NPU Simulation Framework
☆117Apr 25, 2026Updated 2 weeks ago
VIA-Research / uPIMulator
View on GitHub
☆173Feb 1, 2025Updated last year
casys-kaist / NeuPIMs
View on GitHub
NeuPIMs: NPU-PIM Heterogeneous Acceleration for Batched LLM Inferencing
☆116Jun 19, 2024Updated last year
scale-snu / attacc_simulator
View on GitHub
☆149Jun 24, 2024Updated last year
milo168 / FPGA25_SAT_Accel
View on GitHub
FPGA 2025 SAT Accel: A modern SAT Solver on FPGA Repository
☆14Mar 13, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
GATECH-EIC / DNN-Chip-Predictor
View on GitHub
[ICASSP'20] DNN-Chip Predictor: An Analytical Performance Predictor for DNN Accelerators with Various Dataflows and Hardware Architecture…
☆25Oct 1, 2022Updated 3 years ago
PKUZHOU / GNNear-PACT-2022
View on GitHub
GNNear: Accelerating Full-Batch Training of Graph NeuralNetworks with Near-Memory Processing
☆17Sep 15, 2022Updated 3 years ago
kelvin0207 / SparSynergy
View on GitHub
Open source RTL implementation of Tensor Core, Sparse Tensor Core, BitWave and SparSynergy in the article: "SparSynergy: Unlocking Flexib…
☆24Mar 29, 2025Updated last year
KULeuven-MICAS / zigzag-llm
View on GitHub
Model LLM inference on single-core dataflow accelerators
☆18Dec 16, 2025Updated 4 months ago
IPADS-SAI / WaferAI-SIM
View on GitHub
The wafer-native AI accelerator simulation platform and inference engine.
☆55Jan 1, 2026Updated 4 months ago
clevercool / ANT-Quantization
View on GitHub
☆120Nov 17, 2023Updated 2 years ago
BakrN / vortex
View on GitHub
☆17Mar 8, 2025Updated last year
pku-liang / TileFlow
View on GitHub
TileFlow is a performance analysis tool based on Timeloop for fusion dataflows
☆66Apr 12, 2024Updated 2 years ago
SET-Scheduling-Project / GEMINI-HPCA2024
View on GitHub
Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators
☆112Apr 28, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
mit-han-lab / spatten
View on GitHub
[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
☆130Aug 27, 2024Updated last year
GATECH-EIC / ViTCoD
View on GitHub
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆131Jun 27, 2023Updated 2 years ago
SFU-HiAccel / CHIP-KNN
View on GitHub
[TRETS'23, FPT'20] CHIP-KNN: Configurable and HIgh-Performance K-Nearest Neighbors Accelerator on Cloud FPGAs
☆18Apr 9, 2024Updated 2 years ago
PolyArch / fp-diannao
View on GitHub
☆14Apr 8, 2025Updated last year
hpc-ulisboa / NDPmulator
View on GitHub
A Full-System Framework for Simulating NDP devices from Caches to DRAM
☆21Jan 12, 2024Updated 2 years ago
FPSG-UIUC / micro24-fusemax-artifact
View on GitHub
MICRO 2024 Evaluation Artifact for FuseMax
☆17Aug 26, 2024Updated last year
sjtu-zhao-lab / SALO
View on GitHub
An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences
☆32Mar 7, 2024Updated 2 years ago
SET-Scheduling-Project / SET-ISCA2023
View on GitHub
The framework for the paper "Inter-layer Scheduling Space Definition and Exploration for Tiled Accelerators" in ISCA 2023.
☆83Mar 12, 2025Updated last year
leesou / PIM-DL-ASPLOS
View on GitHub
PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization
☆36Feb 21, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hatsu3 / Sanger
View on GitHub
☆47Aug 23, 2021Updated 4 years ago
AIS-SNU / PID-Comm
View on GitHub
☆28Nov 29, 2024Updated last year
pku-liang / OriGen
View on GitHub
OriGen: Enhancing RTL Code Generation with Code-to-Code Augmentation and Self-Reflection(ICCAD 2024)
☆32Oct 20, 2024Updated last year
pku-liang / TENET
View on GitHub
An analytical framework that models hardware dataflow of tensor applications on spatial architectures using the relation-centric notation…
☆88Apr 28, 2024Updated 2 years ago
hossamfadeel / Verilog-Based-NoC-Simulator
View on GitHub
Verilog-Based-NoC-Simulator
☆10May 4, 2016Updated 10 years ago
PrincetonUniversity / LLMCompass
View on GitHub
☆243Oct 24, 2025Updated 6 months ago
lshpku / gem5-runahead
View on GitHub
Implementing the Precise Runahead (HPCA'20) in gem5
☆14Oct 5, 2023Updated 2 years ago