embedeep / Free-TPU
Free TPU for FPGA with compiler supporting Pytorch/Caffe/Darknet/NCNN. An AI processor for using Xilinx FPGA to solve image classification, detection, and segmentation problem.
☆243Updated last year
Alternatives and similar repositories for Free-TPU:
Users that are interested in Free-TPU are comparing it to the libraries listed below
- ☆242Updated 4 years ago
- FPGA Accelerator for CNN using Vivado HLS☆308Updated 3 years ago
- HLS based Deep Neural Network Accelerator Library for Xilinx Ultrascale+ MPSoCs☆325Updated 5 years ago
- Binarized Convolutional Neural Networks on Software-Programmable FPGAs☆304Updated 4 years ago
- A FPGA Based CNN accelerator, following Google's TPU V1.☆130Updated 5 years ago
- DPU on PYNQ☆208Updated last year
- FPGA based acceleration of Convolutional Neural Networks. The project is developed by Verilog for Altera DE5 Net platform.☆177Updated 8 years ago
- NVDLA is an Open source DL/ML accelerator, which is very suitable for individuals or college students. This is the NOTES when I learn and…☆224Updated 6 years ago
- FPGA implementation of Cellular Neural Network (CNN)☆138Updated 6 years ago
- NVDLA (An Opensource DL Accelerator Framework) implementation on FPGA.☆319Updated last year
- A convolutional neural network implemented in hardware (verilog)☆155Updated 7 years ago
- FPGA-based neural network inference project with an end-to-end approach (from training to implementation to deployment)☆263Updated 5 years ago
- A discussion group on Open Source Deep Learning Accelerator, with technical reports and potential hardware/software issues.☆138Updated 7 years ago
- FPGA accelerated TinyYOLO v2 object detection neural network☆68Updated 6 years ago
- Open Source Specialized Computing Stack for Accelerating Deep Neural Networks.☆207Updated 5 years ago
- This is a fully parameterized verilog implementation of computation kernels for accleration of the Inference of Convolutional Neural Netw…☆163Updated 10 months ago
- A hardware implementation of CNN, written by Verilog and synthesized on FPGA☆218Updated 6 years ago
- FPGA-based ZynqNet CNN accelerator developed by Vivado_HLS☆107Updated 7 years ago
- PYNQ, Neural network Language model, Overlay☆105Updated 5 years ago
- FireSim-NVDLA: NVIDIA Deep Learning Accelerator (NVDLA) Integrated with RISC-V Rocket Chip SoC Running on the Amazon FPGA Cloud☆161Updated 3 years ago
- A compiler from AI model to RTL (Verilog) accelerator in FPGA hardware with auto design space exploration.☆405Updated 5 years ago
- SDSoC™ (Software-Defined System-On-Chip) Environment Tutorials☆147Updated 5 years ago
- Deep Learning Accelerator (Convolution Neural Networks)☆170Updated 7 years ago
- Implementation of a Tensor Processing Unit for embedded systems and the IoT.☆417Updated 6 years ago
- HLS Project of pp4fpgas - https://github.com/xupsh/pp4fpgas-cn☆235Updated 3 years ago
- hls code zynq 7020 pynq z2 CNN☆79Updated 5 years ago
- Scalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.☆320Updated last week
- Implementation of CNN using Verilog☆200Updated 7 years ago
- The 1st place winner's source codes for DAC 2018 System Design Contest, FPGA Track☆89Updated 6 years ago
- 在FPGA上面实现一个NPU计算单元。能够执行矩阵运算(ADD/ADDi/ADDs/MULT/MULTi/DOT等)、图像处理运算(CONV/POOL等)、非线性映射(RELU/TANH/SIGM等)。☆212Updated 6 years ago