spcl / gemm_hls
Scalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.
☆336Updated 2 months ago
Alternatives and similar repositories for gemm_hls:
Users that are interested in gemm_hls are comparing it to the libraries listed below
- A collection of extensions for Vitis and Intel FPGA OpenCL to improve developer quality of life.☆318Updated 2 months ago
- Vitis HLS Library for FINN☆191Updated last week
- Examples shown as part of the tutorial "Productive parallel programming on FPGA with high-level synthesis".☆199Updated 3 years ago
- Convolutional accelerator kernel, target ASIC & FPGA☆191Updated 2 years ago
- Vitis_Accel_Examples☆534Updated last month
- IC implementation of Systolic Array for TPU☆224Updated 5 months ago
- FPGA-based neural network inference project with an end-to-end approach (from training to implementation to deployment)☆268Updated 5 years ago
- DPU on PYNQ☆216Updated last year
- ☆667Updated 5 months ago
- Deep Learning Accelerator (Convolution Neural Networks)☆178Updated 7 years ago
- Dataflow QNN inference accelerator examples on FPGAs☆211Updated 3 weeks ago
- A compiler from AI model to RTL (Verilog) accelerator in FPGA hardware with auto design space exploration.☆414Updated 5 years ago
- Convolutional Neural Network Using High Level Synthesis☆86Updated 4 years ago
- Open Source Specialized Computing Stack for Accelerating Deep Neural Networks.☆209Updated 5 years ago
- A FPGA Based CNN accelerator, following Google's TPU V1.☆148Updated 5 years ago
- FPGA Accelerator for CNN using Vivado HLS☆317Updated 3 years ago
- RapidStream TAPA compiles task-parallel HLS program into high-frequency FPGA accelerators.☆165Updated this week
- A convolutional neural network implemented in hardware (verilog)☆157Updated 7 years ago
- A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…☆154Updated 5 years ago
- Deep Learning Accelerator Based on Eyeriss V2 Architecture with custom RISC-V extended instructions☆188Updated 4 years ago
- SystemC/C++ library of commonly-used hardware functions and components for HLS.☆266Updated 5 months ago
- Rosetta: A Realistic High-level Synthesis Benchmark Suite for Software Programmable FPGAs☆164Updated last year
- AutoSA: Polyhedral-Based Systolic Array Compiler☆218Updated 2 years ago
- ☆285Updated 2 weeks ago
- NVDLA (An Opensource DL Accelerator Framework) implementation on FPGA.☆332Updated last year
- A hardware implementation of CNN, written by Verilog and synthesized on FPGA☆226Updated 6 years ago
- This repository hosts the code for an FPGA based accelerator for convolutional neural networks☆147Updated 9 months ago
- HLS based Deep Neural Network Accelerator Library for Xilinx Ultrascale+ MPSoCs☆324Updated 5 years ago
- FPGA based acceleration of Convolutional Neural Networks. The project is developed by Verilog for Altera DE5 Net platform.☆181Updated 8 years ago
- Squeezenet V1.1 on Cyclone V SoC-FPGA at 450ms/image, 20x faster than ARM A9 processor alone. A project for 2017 Innovate FPGA design con…☆107Updated 6 years ago