cea-wind / SimpleTPU
A FPGA Based CNN accelerator, following Google's TPU V1.
☆150Updated 5 years ago
Alternatives and similar repositories for SimpleTPU:
Users that are interested in SimpleTPU are comparing it to the libraries listed below
- Deep Learning Accelerator (Convolution Neural Networks)☆179Updated 7 years ago
- A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…☆157Updated 5 years ago
- IC implementation of TPU☆122Updated 5 years ago
- ☆108Updated 4 years ago
- Convolutional accelerator kernel, target ASIC & FPGA☆196Updated 2 years ago
- FPGA/AES/LeNet/VGG16☆103Updated 6 years ago
- ☆64Updated 6 years ago
- verilog实现TPU中的脉动阵列计算卷积的module☆100Updated 3 years ago
- Convolutional Neural Network Using High Level Synthesis☆87Updated 4 years ago
- This is a fully parameterized verilog implementation of computation kernels for accleration of the Inference of Convolutional Neural Netw…☆177Updated last year
- Deep Learning Accelerator Based on Eyeriss V2 Architecture with custom RISC-V extended instructions☆190Updated 4 years ago
- ☆38Updated 4 years ago
- IC implementation of Systolic Array for TPU☆232Updated 6 months ago
- 中文:☆97Updated 5 years ago
- achieve softmax in PYNQ with heterogeneous computing.☆63Updated 6 years ago
- FPGA-based ZynqNet CNN accelerator developed by Vivado_HLS☆112Updated 7 years ago
- An LeNet RTL implement onto FPGA☆46Updated 7 years ago
- FPGA and GPU acceleration of LeNet5☆35Updated 5 years ago
- NVDLA (An Opensource DL Accelerator Framework) implementation on FPGA.☆335Updated last year
- CNN accelerator implemented with Spinal HDL☆149Updated last year
- hls code zynq 7020 pynq z2 CNN☆85Updated 6 years ago
- GPGPU supporting RISCV-V, developed with verilog HDL☆95Updated 2 months ago
- Squeezenet V1.1 on Cyclone V SoC-FPGA at 450ms/image, 20x faster than ARM A9 processor alone. A project for 2017 Innovate FPGA design con…☆109Updated 6 years ago
- ☆65Updated 2 years ago
- 16-bit Adder Multiplier hardware on Digilent Basys 3☆72Updated last year
- FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference☆145Updated last year
- INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.☆99Updated 4 years ago
- hardware design of universal NPU(CNN accelerator) for various convolution neural network☆118Updated 2 months ago
- Small-scale Tensor Processing Unit built on an FPGA☆181Updated 5 years ago
- DPU on PYNQ☆219Updated last year