yuyuranium / FPGA-Project-2022-simple-tpu
Systolic array based simple TPU for CNN on PYNQ-Z2
☆30Updated 2 years ago
Alternatives and similar repositories for FPGA-Project-2022-simple-tpu:
Users that are interested in FPGA-Project-2022-simple-tpu are comparing it to the libraries listed below
- Hardware accelerator for convolutional neural networks☆43Updated 2 years ago
- A Flexible and Energy Efficient Accelerator For Sparse Convolution Neural Network☆65Updated 2 months ago
- ☆108Updated 4 years ago
- SystemVerilog files for lab project on a DNN hardware accelerator☆16Updated 3 years ago
- A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…☆157Updated 5 years ago
- verilog实现TPU中的脉动阵列计算卷积的module☆100Updated 3 years ago
- Efficient FPGA-Based Accelerator for Convolutional Neural Networks☆11Updated 9 months ago
- tpu-systolic-array-weight-stationary☆24Updated 3 years ago
- ☆64Updated 6 years ago
- CNN-Accelerator based on FPGA developed by verilog HDL.☆48Updated 5 years ago
- INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.☆99Updated 4 years ago
- Convolutional Neural Network Implemented in Verilog for System on Chip☆27Updated 6 years ago
- Convolutional accelerator kernel, target ASIC & FPGA☆196Updated 2 years ago
- CNN hardware accelerator to accelerate quantized LeNet-5 model☆33Updated last year
- A Verilog design of LeNet-5, a Convolutional Neural Network architecture☆30Updated 4 years ago
- This is a verilog implementation of 4x4 systolic array multiplier☆53Updated 4 years ago
- ☆38Updated 4 years ago
- 16-bit Adder Multiplier hardware on Digilent Basys 3☆72Updated last year
- 3×3脉动阵列乘法器☆44Updated 5 years ago
- ☆32Updated 6 years ago
- Designing CNN accelerator using a Xilinx FPGA board and comparing performance with CPU.☆22Updated 4 years ago
- This is a simple project that shows how to multiply two 3x3 matrixes in Verilog.☆50Updated 7 years ago
- 32 - bit floating point Multiplier Accumulator Unit (MAC)☆30Updated 4 years ago
- IC implementation of TPU☆122Updated 5 years ago
- ☆14Updated 2 years ago
- Systolic matrix multiplication kernel implemented on Xilinx PYNQ FPGA board☆14Updated 4 years ago
- General CNN_Accelerator design.卷积神经网络加速器设计。在PYNQ-Z2 FPGA开发板上实现了卷积池化全连接层等硬件加速计算。☆44Updated 2 months ago
- An LeNet RTL implement onto FPGA☆46Updated 7 years ago
- High Level Synthesis of a trained Convolutional Neural Network for handwritten digit recongnition.☆38Updated 9 months ago
- 使用FPGA实现CNN模型☆14Updated 5 years ago