rehohoho / onnx2versal
Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.
☆9Updated last year
Related projects ⓘ
Alternatives and complementary repositories for onnx2versal
- ☆19Updated last week
- A course based on FINN with hands on Lectures, Examples and Labs to go from 0 to a full custom Quantized Neural Network running on your v…☆14Updated last month
- Open Source Compiler Framework using ONNX as Frontend and IR☆29Updated 2 years ago
- ☆27Updated 5 years ago
- Generate an FPGA design for a TWN☆9Updated 5 years ago
- ☆70Updated last year
- ☆20Updated 2 years ago
- SAMO: Streaming Architecture Mapping Optimisation☆32Updated last year
- ☆55Updated 4 years ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆42Updated 2 years ago
- ☆13Updated 4 years ago
- ☆83Updated 5 months ago
- Performance and resource models for fpgaConvNet: a Streaming-Architecture-based CNN Accelerator.☆27Updated 2 weeks ago
- Repository for work on on Xilinx's matrix vector activation unit's RTL implementation. Documentation available at: https://asadalam.githu…☆15Updated 2 years ago
- ☆21Updated 2 years ago
- [DAC 2020] Analysis and Optimization of the Implicit Broadcasts in FPGA HLS to Improve Maximum Frequency☆32Updated 3 years ago
- PolyLUT is the first quantized neural network training methodology that maps a neuron to a LUT while using multivariate polynomial functi…☆44Updated 9 months ago
- A DSL for Systolic Arrays☆78Updated 5 years ago
- ☆33Updated 3 years ago
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated this week
- A Generic Distributed Auto-Tuning Infrastructure☆21Updated 3 years ago
- ☆32Updated 5 years ago
- A Spatial Accelerator Generation Framework for Tensor Algebra.☆52Updated 2 years ago
- An implementation of a BinaryConnect network for cifar10☆11Updated 5 years ago
- Quantized ResNet50 Dataflow Acceleration on Alveo, with PYNQ☆52Updated 2 years ago
- dMazeRunner: Dataflow acceleration optimization infrastructure for coarse-grained programmable accelerators☆44Updated 2 years ago
- MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)☆15Updated 7 months ago
- ☆40Updated 4 years ago
- A floating-point matrix multiplication implemented in hardware☆29Updated 3 years ago
- ☆12Updated 2 years ago