KastnerRG / cgra4mlLinks
An Open Workflow to Build Custom SoCs and run Deep Models at the Edge
☆97Updated last week
Alternatives and similar repositories for cgra4ml
Users that are interested in cgra4ml are comparing it to the libraries listed below
Sorting:
- Train and deploy LUT-based neural networks on FPGAs☆102Updated last year
- SAURIA (Systolic-Array tensor Unit for aRtificial Intelligence Acceleration) is an open-source Convolutional Neural Network accelerator b…☆68Updated last week
- Verilog implementation of Softmax function☆77Updated 3 years ago
- 16-bit Adder Multiplier hardware on Digilent Basys 3☆82Updated 2 years ago
- INT8 & FP16 multiplier accumulator (MAC) design with UVM verification completed.☆108Updated 5 years ago
- This is a verilog implementation of 4x4 systolic array multiplier☆70Updated 5 years ago
- ☆71Updated 6 years ago
- BARVINN: A Barrel RISC-V Neural Network Accelerator: https://barvinn.readthedocs.io/en/latest/☆92Updated 11 months ago
- NeuraLUT-Assemble☆43Updated 3 months ago
- Deep Learning Accelerator Based on Eyeriss V2 Architecture with custom RISC-V extended instructions☆204Updated 5 years ago
- IC implementation of TPU☆140Updated 5 years ago
- ☆57Updated 7 months ago
- ☆63Updated 5 years ago
- ☆38Updated 6 years ago
- ☆64Updated 7 months ago
- An Open-Hardware CGRA for accelerated computation on the edge.☆38Updated last month
- Verilog Implementation of 32-bit Floating Point Adder☆44Updated 5 years ago
- A Fast, Low-Overhead On-chip Network☆247Updated this week
- FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference☆167Updated 2 years ago
- AMD University Program HLS tutorial☆120Updated last year
- Systolic matrix multiplication kernel implemented on Xilinx PYNQ FPGA board☆14Updated 5 years ago
- NoC (Network-on-Chip) generator that generates Verilog HDL model of NoC consisting of on-chip routers☆71Updated 5 years ago
- Vitis HLS Library for FINN☆210Updated 2 months ago
- TAPA compiles task-parallel HLS program into high-performance FPGA accelerators. UCLA-maintained.☆176Updated 3 months ago
- PYNQ Composabe Overlays☆73Updated last year
- Synthesizable Floating point unit written using Verilog. Supports 32-bit (Single-Precision) Multiplication, Addition and Division and Squ…☆66Updated last year
- CGRA-Flow is an integrated framework for CGRA compilation, exploration, synthesis, and development.☆145Updated 2 weeks ago
- Prototype-network-on-chip (ProNoC) is an EDA tool that facilitates prototyping of custom heterogeneous NoC-based many-core-SoC (MCSoC).☆59Updated this week
- Hardware accelerator for convolutional neural networks☆60Updated 3 years ago
- CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture☆163Updated this week