xupsh / ccc2021
view at https://xupsh.github.io/ccc2021/
☆24Updated 2 years ago
Related projects: ⓘ
- eyeriss-chisel3☆35Updated 2 years ago
- Template-based Reconfigurable Architecture Modeling Framework☆13Updated 2 years ago
- Systolic matrix multiplication kernel implemented on Xilinx PYNQ FPGA board☆9Updated 4 years ago
- ☆58Updated 5 years ago
- An integrated CGRA design framework☆82Updated 9 months ago
- [ASAP 2020; FPGA 2020] Hardware architecture to accelerate GNNs (common IP modules for minibatch training and full batch inference)☆41Updated 3 years ago
- An HLS based winograd systolic CNN accelerator☆46Updated 3 years ago
- A Unified Framework for Training, Mapping and Simulation of ReRAM-Based Convolutional Neural Network Acceleration☆30Updated 2 years ago
- An Automated Framework for Generic Graph Neural Network Accelerator Generation, Simulation, and Optimization☆18Updated 9 months ago
- High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS☆77Updated last month
- Systolic array implementations for Cholesky, LU, and QR decomposition☆38Updated 5 years ago
- ☆65Updated last year
- A systolic array matrix multiplier☆22Updated 5 years ago
- This work implements a dynamic programming algorithm for performing local sequence alignment. Through parallelism, it can run 136X times …☆20Updated 5 years ago
- A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…☆121Updated 4 years ago
- 16-bit Adder Multiplier hardware on Digilent Basys 3☆62Updated last year
- An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).☆62Updated last month
- A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.☆60Updated 2 years ago
- An Open-Source Tool for CGRA Accelerators☆50Updated last month
- This repository contains all the parameters you need to synthesize the AlexNet by using Vivado High Level Synthesis.☆20Updated 6 years ago
- ☆67Updated 4 years ago
- verilog实现TPU中的脉动阵列计算卷积的module☆66Updated 2 years ago
- HLS implemented systolic array structure☆38Updated 6 years ago
- A DSL for Systolic Arrays☆73Updated 5 years ago
- ☆38Updated this week
- 3×3脉动阵列乘法器☆33Updated 5 years ago
- A verilog implementation for Network-on-Chip☆60Updated 6 years ago
- INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.☆79Updated 3 years ago
- MAERI: A DNN accelerator with reconfigurable interconnects to support flexible dataflow (http://synergy.ece.gatech.edu/tools/maeri/)☆56Updated 2 years ago
- ☆18Updated 11 months ago