dldldlfma / super_small_toy_tpu
☆17Updated 4 years ago
Alternatives and similar repositories for super_small_toy_tpu:
Users that are interested in super_small_toy_tpu are comparing it to the libraries listed below
- tpu-systolic-array-weight-stationary☆20Updated 3 years ago
- A Flexible and Energy Efficient Accelerator For Sparse Convolution Neural Network☆40Updated 5 months ago
- verilog实现TPU中的脉动阵列计算卷积的module☆76Updated 3 years ago
- eyeriss-chisel3☆40Updated 2 years ago
- AIChip 2021 project, NCKU☆14Updated 3 years ago
- CNN hardware accelerator to accelerate quantized LeNet-5 model☆28Updated last year
- ☆100Updated 4 years ago
- AI Chip project☆17Updated 3 years ago
- SystemVerilog files for lab project on a DNN hardware accelerator☆14Updated 3 years ago
- FPGA implement of 8x8 weight stationary systolic array DNN accelerator☆11Updated 3 years ago
- Hardware accelerator for convolutional neural networks☆35Updated 2 years ago
- An open source Verilog Based LeNet-1 Parallel CNNs Accelerator for FPGAs in Vivado 2017☆14Updated 5 years ago
- A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…☆139Updated 5 years ago
- ☆60Updated 6 years ago
- ☆14Updated last year
- ☆13Updated last year
- CNN-Accelerator based on FPGA developed by verilog HDL.☆45Updated 4 years ago
- An HLS based winograd systolic CNN accelerator☆50Updated 3 years ago
- Systolic array based simple TPU for CNN on PYNQ-Z2☆22Updated 2 years ago
- CS533 Course Project (ongoing) - Exploring Parallel Architectures for Neural Processing Unit Implementations☆18Updated 7 years ago
- A DNN Accelerator implemented with RTL.☆63Updated last month
- achieve softmax in PYNQ with heterogeneous computing.☆15Updated 6 years ago
- A VGG accelerator by System Verilog on DE1-SoC FPGA. Row Stationary (RS) dataflow is adopted, and computations are based on fixed point 1…☆30Updated 5 years ago
- Convolutional Neural Network Implemented in Verilog for System on Chip☆26Updated 5 years ago
- INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.☆87Updated 4 years ago
- ☆29Updated 5 years ago
- A Reconfigurable Accelerator for Deep Convolutional Neural Networks Implemented by Chisel3.☆27Updated 3 years ago
- Hardware implementation of Spiking Neural Network on a PYNQ-Z1 board☆34Updated 5 years ago
- 16-bit Adder Multiplier hardware on Digilent Basys 3☆66Updated last year
- Systolic matrix multiplication kernel implemented on Xilinx PYNQ FPGA board☆11Updated 4 years ago