arc-research-lab / SSRLinks

SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration (Full Paper Accepted in FPGA'24)

☆33

Alternatives and similar repositories for SSR

Users that are interested in SSR are comparing it to the libraries listed below

Sorting:

jha-lab / acceltran
[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers
☆52Updated last year
linghaosong / Sextans
An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).
☆87Updated last year
actlab-genesys / GeneSys
An open-source parameterizable NPU generator with full-stack multi-target compilation stack for intelligent workloads.
☆68Updated 3 weeks ago
arc-research-lab / CHARM
CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture
☆157Updated this week
maeri-project / FEATHER
A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching
☆67Updated last month
diwu1990 / uSystolic-Sim
A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.
☆81Updated 3 years ago
pku-liang / Sanger
A co-design architecture on sparse attention
☆53Updated 4 years ago
CASR-HKU / MSD-FCCM23
Open-source of MSD framework
☆16Updated 2 years ago
hatsu3 / Sanger
☆48Updated 4 years ago
cornell-zhang / HiSparse
High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS
☆95Updated last year
scalesim-project / scale-sim-v3
☆48Updated 2 months ago
cjg91 / trans-fat
An FPGA Accelerator for Transformer Inference
☆91Updated 3 years ago
xliu0709 / WinoCNN
An HLS based winograd systolic CNN accelerator
☆54Updated 4 years ago
KULeuven-MICAS / stream
Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.
☆60Updated 3 months ago
ECASLab / hls-fpga-accelerators
Collection of kernel accelerators optimised for LLM execution
☆25Updated 3 weeks ago
wangxy-2000 / pimsim-nn
☆58Updated last year
mit-emze / cimloop
☆68Updated 2 weeks ago
cornell-zhang / FracBNN
FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
☆94Updated 4 years ago
georgia-tech-synergy-lab / SIGMA
RTL implementation of Flex-DPE.
☆113Updated 5 years ago
isakedo / DNNsim
☆35Updated 5 years ago
leesou / H2-LLM-ISCA-2025
H2-LLM: Hardware-Dataflow Co-Exploration for Heterogeneous Hybrid-Bonding-based Low-Batch LLM Inference
☆72Updated 6 months ago
fffasttime / AnyPackingNet
☆29Updated 6 months ago
hguq / HG-PIPE
FPGA-based hardware accelerator for Vision Transformer (ViT), with Hybrid-Grained Pipeline.
☆98Updated 9 months ago
GATECH-EIC / ViTCoD
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆122Updated 2 years ago
UCLA-VAST / Serpens
Serpens is an HBM FPGA accelerator for SpMV
☆22Updated last year
gem5-X / TiC-SAT
☆17Updated 5 months ago
Zhu-Zixuan / Bitlet-PE
A bit-level sparsity-awared multiply-accumulate process element.
☆17Updated last year
Accelergy-Project / micro22-sparseloop-artifact
MICRO22 artifact evaluation for Sparseloop
☆44Updated 3 years ago
KULeuven-MICAS / DeFiNES
A framework for fast exploration of the depth-first scheduling space for DNN accelerators
☆40Updated 2 years ago
UIUC-ChenLab / ScaleHLS-HIDA
☆60Updated 7 months ago