sjtu-zhao-lab / SALOLinks

An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences

☆29

Alternatives and similar repositories for SALO

Users that are interested in SALO are comparing it to the libraries listed below

Sorting:

hatsu3 / Sanger
☆48Updated 4 years ago
pku-liang / Sanger
A co-design architecture on sparse attention
☆53Updated 4 years ago
abdelfattah-lab / BitMoD-HPCA-25
☆51Updated 3 months ago
leesou / H2-LLM-ISCA-2025
H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference
☆72Updated 6 months ago
GATECH-EIC / ViTALiTy
ViTALiTy (HPCA'23) Code Repository
☆23Updated 2 years ago
jha-lab / acceltran
[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers
☆52Updated last year
jeffreyyu0602 / quantized-training
☆32Updated this week
mit-han-lab / spatten
[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
☆112Updated last year
scalesim-project / scale-sim-v3
☆48Updated 2 months ago
SET-Scheduling-Project / GEMINI-HPCA2024
Open-source Framework for HPCA2024 paper: Gemini: Mapping and Architecture Co-exploration for Large-scale DNN Chiplet Accelerators
☆97Updated 6 months ago
SFU-HiAccel / HiSpMV
[TRETS 2025][FPGA 2024] FPGA Accelerator for Imbalanced SpMV using HLS
☆15Updated 2 months ago
GATECH-EIC / ViTCoD
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆122Updated 2 years ago
kelvin0207 / SparSynergy
Open source RTL implementation of Tensor Core, Sparse Tensor Core, BitWave and SparSynergy in the article: "SparSynergy: Unlocking Flexib…
☆19Updated 7 months ago
Zhu-Zixuan / Bitlet-PE
A bit-level sparsity-awared multiply-accumulate process element.
☆17Updated last year
Zhaoshixin-sky / CIM-MLC
[ASPLOS 2024] CIM-MLC: A Multi-level Compilation Stack for Computing-In-Memory Accelerators
☆48Updated last year
Accelergy-Project / micro22-sparseloop-artifact
MICRO22 artifact evaluation for Sparseloop
☆44Updated 3 years ago
maeri-project / FEATHER
A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching
☆67Updated last month
actlab-genesys / GeneSys
An open-source parameterizable NPU generator with full-stack multi-target compilation stack for intelligent workloads.
☆68Updated last month
clevercool / ANT-Quantization
☆111Updated last year
fangjh21 / PALM
PALM: A Efficient Performance Simulator for Tiled Accelerators with Large-scale Model Training
☆19Updated last year
CASR-HKU / MSD-FCCM23
Open-source of MSD framework
☆16Updated 2 years ago
SET-Scheduling-Project / SET-ISCA2023
The framework for the paper "Inter-layer Scheduling Space Definition and Exploration for Tiled Accelerators" in ISCA 2023.
☆76Updated 7 months ago
isakedo / DNNsim
☆35Updated 5 years ago
pku-liang / TENET
An analytical framework that models hardware dataflow of tensor applications on spatial architectures using the relation-centric notation…
☆87Updated last year
KULeuven-MICAS / DeFiNES
A framework for fast exploration of the depth-first scheduling space for DNN accelerators
☆40Updated 2 years ago
scale-snu / attacc_simulator
☆97Updated last year
arc-research-lab / SSR
SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration (Full Paper Accepted in FPGA'24)
☆33Updated this week
diwu1990 / uSystolic-Sim
A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.
☆81Updated 3 years ago
KULeuven-MICAS / stream
Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.
☆61Updated 3 months ago
witmemtech / CIM-Technical-Papers-Collection
Computing in memory optimizes data handling by performing operations directly in memory, ideal for high-speed data processing needs. This…
☆27Updated 11 months ago