kaiiiz / hls-spmvLinks
High-level synthesis (HLS) implementation of Sparse Matrix Vector Multiplication
☆18Updated 3 years ago
Alternatives and similar repositories for hls-spmv
Users that are interested in hls-spmv are comparing it to the libraries listed below
Sorting:
- An FPGA Accelerator for Transformer Inference☆92Updated 3 years ago
- An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).☆91Updated last year
- CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture☆159Updated this week
- Vitis HLS Library for FINN☆210Updated last month
- RTL generator for SpGEMM☆11Updated 4 years ago
- FPGA based Vision Transformer accelerator (Harvard CS205)☆137Updated 9 months ago
- ☆121Updated 5 years ago
- High Level Synthesis of a trained Convolutional Neural Network for handwritten digit recongnition.☆41Updated last year
- An HLS based winograd systolic CNN accelerator☆54Updated 4 years ago
- TAPA compiles task-parallel HLS program into high-performance FPGA accelerators. UCLA-maintained.☆175Updated 3 months ago
- eyeriss-chisel3☆40Updated 3 years ago
- [ASAP 2020; FPGA 2020] Hardware architecture to accelerate GNNs (common IP modules for minibatch training and full batch inference)☆41Updated 4 years ago
- A Flexible and Energy Efficient Accelerator For Sparse Convolution Neural Network☆118Updated 4 months ago
- A Unified Framework for Training, Mapping and Simulation of ReRAM-Based Convolutional Neural Network Acceleration☆35Updated 3 years ago
- High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS☆95Updated last year
- ☆33Updated last year
- Open-source of MSD framework☆16Updated 2 years ago
- A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.☆82Updated 4 years ago
- Collection of kernel accelerators optimised for LLM execution☆24Updated this week
- IC implementation of TPU☆135Updated 5 years ago
- A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…☆176Updated 5 years ago
- INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.☆107Updated 5 years ago
- ☆71Updated 6 years ago
- Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.☆64Updated 4 months ago
- verilog实现TPU中的脉动阵列计算卷积的module☆141Updated 6 months ago
- Hardware accelerator for convolutional neural networks☆60Updated 3 years ago
- Quantized ResNet50 Dataflow Acceleration on Alveo, with PYNQ☆59Updated 4 years ago
- This is a verilog implementation of 4x4 systolic array multiplier☆70Updated 5 years ago
- Convolutional Neural Network Using High Level Synthesis☆89Updated 5 years ago
- SAURIA (Systolic-Array tensor Unit for aRtificial Intelligence Acceleration) is an open-source Convolutional Neural Network accelerator b…☆66Updated 2 weeks ago