spcl / gemm_hlsLinks
Scalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.
☆371Updated 10 months ago
Alternatives and similar repositories for gemm_hls
Users that are interested in gemm_hls are comparing it to the libraries listed below
Sorting:
- A collection of extensions for Vitis and Intel FPGA OpenCL to improve developer quality of life.☆332Updated 10 months ago
- Vitis HLS Library for FINN☆210Updated 2 months ago
- Examples shown as part of the tutorial "Productive parallel programming on FPGA with high-level synthesis".☆204Updated 4 years ago
- Vitis_Accel_Examples☆575Updated 2 weeks ago
- Open Source Specialized Computing Stack for Accelerating Deep Neural Networks.☆226Updated 6 years ago
- A compiler from AI model to RTL (Verilog) accelerator in FPGA hardware with auto design space exploration.☆435Updated 6 years ago
- Deep Learning Accelerator (Convolution Neural Networks)☆195Updated 7 years ago
- FPGA-based neural network inference project with an end-to-end approach (from training to implementation to deployment)☆281Updated 6 years ago
- AutoSA: Polyhedral-Based Systolic Array Compiler☆230Updated 2 years ago
- DPU on PYNQ☆234Updated 3 months ago
- IC implementation of Systolic Array for TPU☆309Updated last year
- SystemC/C++ library of commonly-used hardware functions and components for HLS.☆287Updated last month
- HLS based Deep Neural Network Accelerator Library for Xilinx Ultrascale+ MPSoCs☆335Updated 6 years ago
- A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…☆177Updated 5 years ago
- Dataflow QNN inference accelerator examples on FPGAs☆239Updated 3 months ago
- Convolutional Neural Network Using High Level Synthesis☆90Updated 5 years ago
- ☆743Updated 2 weeks ago
- A FPGA Based CNN accelerator, following Google's TPU V1.☆162Updated 6 years ago
- AMD University Program HLS tutorial☆120Updated last year
- CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture☆163Updated last week
- Convolutional accelerator kernel, target ASIC & FPGA☆235Updated 2 years ago
- TAPA compiles task-parallel HLS program into high-performance FPGA accelerators. UCLA-maintained.☆176Updated 3 months ago
- Implementation of a Tensor Processing Unit for embedded systems and the IoT.☆516Updated 6 years ago
- Deep Learning Accelerator Based on Eyeriss V2 Architecture with custom RISC-V extended instructions☆204Updated 5 years ago
- Rosetta: A Realistic High-level Synthesis Benchmark Suite for Software Programmable FPGAs (FPGA'18)☆168Updated 2 years ago
- Embedded Scalable Platforms: Heterogeneous SoC architecture and IP integration made easy☆394Updated last month
- NVDLA (An Opensource DL Accelerator Framework) implementation on FPGA.☆373Updated last year
- A convolutional neural network implemented in hardware (verilog)☆165Updated 8 years ago
- NVDLA is an Open source DL/ML accelerator, which is very suitable for individuals or college students. This is the NOTES when I learn and…☆230Updated 6 years ago
- Small-scale Tensor Processing Unit built on an FPGA☆209Updated 6 years ago