wubinyi / Convolutional-Neural-Network-Accelerator
Deep learning accelerator for convolutional layer (convolution operation) and fully-connected layer(matrix-multiplication).
☆20Updated 6 years ago
Alternatives and similar repositories for Convolutional-Neural-Network-Accelerator:
Users that are interested in Convolutional-Neural-Network-Accelerator are comparing it to the libraries listed below
- eyeriss-chisel3☆40Updated 2 years ago
- tpu-systolic-array-weight-stationary☆20Updated 3 years ago
- Hardware accelerator for convolutional neural networks☆36Updated 2 years ago
- verilog实现TPU中的脉动阵列计算卷积的module☆77Updated 3 years ago
- A Flexible and Energy Efficient Accelerator For Sparse Convolution Neural Network☆44Updated 5 months ago
- ☆100Updated 4 years ago
- CNN hardware accelerator to accelerate quantized LeNet-5 model☆28Updated last year
- A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…☆139Updated 5 years ago
- INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.☆88Updated 4 years ago
- 32 - bit floating point Multiplier Accumulator Unit (MAC)☆27Updated 4 years ago
- An open source Verilog Based LeNet-1 Parallel CNNs Accelerator for FPGAs in Vivado 2017☆14Updated 5 years ago
- A systolic array matrix multiplier☆24Updated 5 years ago
- Systolic matrix multiplication kernel implemented on Xilinx PYNQ FPGA board☆11Updated 4 years ago
- ☆60Updated 6 years ago
- Open-source of MSD framework☆16Updated last year
- FPGA implement of 8x8 weight stationary systolic array DNN accelerator☆11Updated 3 years ago
- SystemVerilog files for lab project on a DNN hardware accelerator☆15Updated 3 years ago
- ☆29Updated 5 years ago
- ☆12Updated 8 months ago
- ☆13Updated last year
- HLS implemented systolic array structure☆41Updated 7 years ago
- A verilog implementation for Network-on-Chip☆71Updated 7 years ago
- ☆70Updated 10 years ago
- Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.☆48Updated this week
- A co-design architecture on sparse attention☆51Updated 3 years ago
- A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching☆46Updated 4 months ago
- A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.☆68Updated 3 years ago
- A bit-level sparsity-awared multiply-accumulate process element.☆13Updated 7 months ago
- RTL generator for SpGEMM☆9Updated 4 years ago
- An integrated CGRA design framework☆85Updated 3 months ago