asadalam / FINN_MatrixVector_RTL
Repository for work on on Xilinx's matrix vector activation unit's RTL implementation. Documentation available at: https://asadalam.github.io/FINN_MatrixVector_RTL/
☆15Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for FINN_MatrixVector_RTL
- Performance and resource models for fpgaConvNet: a Streaming-Architecture-based CNN Accelerator.☆27Updated this week
- Designs for finalist teams of the DAC System Design Contest☆35Updated 4 years ago
- ☆69Updated last year
- Contains FPGA benchmarks for Vivado HLS and Catapult HLS☆24Updated 4 years ago
- ☆20Updated 2 years ago
- ☆8Updated last year
- Provides the hardware code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerator…☆24Updated 4 years ago
- A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.☆65Updated 3 years ago
- SAMO: Streaming Architecture Mapping Optimisation☆32Updated last year
- ☆22Updated 5 years ago
- HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond☆16Updated this week
- HLS implemented systolic array structure☆40Updated 6 years ago
- A collection of tutorials for the fpgaConvNet framework.☆30Updated last month
- ☆32Updated 5 years ago
- PolyLUT is the first quantized neural network training methodology that maps a neuron to a LUT while using multivariate polynomial functi…☆39Updated 9 months ago
- ☆55Updated 4 years ago
- Algorithmic C Machine Learning Library☆22Updated 3 months ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆42Updated 2 years ago
- CNN Accelerator in Frequency Domain☆10Updated 4 years ago
- dMazeRunner: Dataflow acceleration optimization infrastructure for coarse-grained programmable accelerators☆44Updated 2 years ago
- Low level design of a chip built for optimizing/accelerating CNN classifiers over gray scale images.☆12Updated 5 years ago
- Quantized ResNet50 Dataflow Acceleration on Alveo, with PYNQ☆51Updated 2 years ago
- The Verilog source code for DRUM approximate multiplier.☆27Updated last year
- A fast, accurate trace-based simulator for High-Level Synthesis.☆34Updated 6 months ago
- [DAC 2020] Analysis and Optimization of the Implicit Broadcasts in FPGA HLS to Improve Maximum Frequency☆32Updated 3 years ago
- ☆27Updated 5 years ago
- ☆13Updated 4 years ago
- ☆33Updated 3 years ago
- MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)☆15Updated 6 months ago
- An HLS based winograd systolic CNN accelerator☆48Updated 3 years ago