thousrm / universal_NPU-CNN_accelerator
hardware design of universal NPU(CNN accelerator) for various convolution neural network
☆75Updated this week
Related projects ⓘ
Alternatives and complementary repositories for universal_NPU-CNN_accelerator
- A Flexible and Energy Efficient Accelerator For Sparse Convolution Neural Network☆32Updated 2 months ago
- INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.☆83Updated 4 years ago
- 16-bit Adder Multiplier hardware on Digilent Basys 3☆63Updated last year
- A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…☆129Updated 4 years ago
- IC implementation of Systolic Array for TPU☆152Updated last month
- tpu-systolic-array-weight-stationary☆18Updated 3 years ago
- Verilog implementation of Softmax function☆48Updated 2 years ago
- Convolutional accelerator kernel, target ASIC & FPGA☆167Updated last year
- ☆60Updated 5 years ago
- ☆93Updated 4 years ago
- verilog实现TPU中的脉动阵列计算卷积的module☆68Updated 2 years ago
- A VGG accelerator by System Verilog on DE1-SoC FPGA. Row Stationary (RS) dataflow is adopted, and computations are based on fixed point 1…☆29Updated 5 years ago
- IC implementation of TPU☆86Updated 4 years ago
- Hardware accelerator for convolutional neural networks☆26Updated 2 years ago
- An HLS based winograd systolic CNN accelerator☆48Updated 3 years ago
- CNN-Accelerator based on FPGA developed by verilog HDL.☆45Updated 4 years ago
- This is a verilog implementation of 4x4 systolic array multiplier☆39Updated 4 years ago
- SystemVerilog files for lab project on a DNN hardware accelerator☆12Updated 3 years ago
- eyeriss-chisel3☆39Updated 2 years ago
- FPGA based Vision Transformer accelerator (Harvard CS205)☆85Updated 11 months ago
- An FPGA Accelerator for Transformer Inference☆73Updated 2 years ago
- ☆26Updated 5 years ago
- Convolutional Neural Network Using High Level Synthesis☆83Updated 4 years ago
- CNN hardware accelerator to accelerate quantized LeNet-5 model☆20Updated last year
- FPGA and GPU acceleration of LeNet5☆35Updated 5 years ago
- A DNN Accelerator implemented with RTL.☆61Updated last year
- Deep Learning Accelerator Based on Eyeriss V2 Architecture with custom RISC-V extended instructions☆176Updated 4 years ago
- Deep Learning Accelerator (Convolution Neural Networks)☆166Updated 6 years ago
- ☆64Updated 2 years ago
- This TRD is implement DPU v1.4.0 on PYNQ-Z2 board☆44Updated 4 years ago