yuyuranium / FPGA-Project-2022-simple-tpu
Systolic array based simple TPU for CNN on PYNQ-Z2
☆23Updated 2 years ago
Alternatives and similar repositories for FPGA-Project-2022-simple-tpu:
Users that are interested in FPGA-Project-2022-simple-tpu are comparing it to the libraries listed below
- Hardware accelerator for convolutional neural networks☆36Updated 2 years ago
- Convolutional Neural Network Implemented in Verilog for System on Chip☆26Updated 5 years ago
- A Flexible and Energy Efficient Accelerator For Sparse Convolution Neural Network☆41Updated 5 months ago
- ☆13Updated last year
- ☆60Updated 6 years ago
- 32 - bit floating point Multiplier Accumulator Unit (MAC)☆27Updated 4 years ago
- tpu-systolic-array-weight-stationary☆20Updated 3 years ago
- SystemVerilog files for lab project on a DNN hardware accelerator☆15Updated 3 years ago
- CNN hardware accelerator to accelerate quantized LeNet-5 model☆28Updated last year
- CNN-Accelerator based on FPGA developed by verilog HDL.☆45Updated 4 years ago
- A Verilog design of LeNet-5, a Convolutional Neural Network architecture☆26Updated 4 years ago
- This is a simple project that shows how to multiply two 3x3 matrixes in Verilog.☆50Updated 7 years ago
- 16-bit Adder Multiplier hardware on Digilent Basys 3☆66Updated last year
- ☆29Updated 5 years ago
- Systolic matrix multiplication kernel implemented on Xilinx PYNQ FPGA board☆11Updated 4 years ago
- INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.☆88Updated 4 years ago
- DMA controller for CNN accelerator☆13Updated 7 years ago
- FPGA implement of 8x8 weight stationary systolic array DNN accelerator☆11Updated 3 years ago
- A project on hardware design for convolutional neural network. This neural network is of 2 layers with 400 inputs in the first layer. Thi…☆17Updated 6 years ago
- ☆100Updated 4 years ago
- verilog实现TPU中的脉动阵列计算卷积的module☆77Updated 3 years ago
- The Verilog source code for DRUM approximate multiplier.☆29Updated last year
- CNN accelerator using NoC architecture☆15Updated 6 years ago
- I present a novel pipelined fast Fourier transform (FFT) architecture which is capable of producing the output sequence in normal order. …☆39Updated last year
- eyeriss-chisel3☆40Updated 2 years ago
- A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…☆139Updated 5 years ago
- Prototype-network-on-chip (ProNoC) is an EDA tool that facilitates prototyping of custom heterogeneous NoC-based many-core-SoC (MCSoC).☆50Updated this week
- Designing CNN accelerator using a Xilinx FPGA board and comparing performance with CPU.☆21Updated 3 years ago
- A systolic array matrix multiplier☆24Updated 5 years ago
- This TRD is implement DPU v1.4.0 on PYNQ-Z2 board☆44Updated 4 years ago