rehohoho / onnx2versalLinks
Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.
☆14Updated 8 months ago
Alternatives and similar repositories for onnx2versal
Users that are interested in onnx2versal are comparing it to the libraries listed below
Sorting:
- NeuraLUT-Assemble☆41Updated last month
- Xilinx Modifications to Halide☆13Updated 4 years ago
- SAMO: Streaming Architecture Mapping Optimisation☆34Updated last year
- MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)☆21Updated last year
- Train and deploy LUT-based neural networks on FPGAs☆98Updated last year
- ☆72Updated 2 years ago
- A DSL for Systolic Arrays☆81Updated 6 years ago
- ☆30Updated 6 years ago
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆24Updated 9 months ago
- PyTorch model to RTL flow for low latency inference☆131Updated last year
- ☆60Updated 5 years ago
- ☆23Updated 2 years ago
- Tool for optimize CNN blocking☆94Updated 5 years ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆48Updated 7 months ago
- An MLIR Complier for PyTorch/C/C++ Codes into HLS Dataflow Designs☆47Updated last month
- An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).☆84Updated last year
- Open Source Compiler Framework using ONNX as Frontend and IR☆33Updated 3 years ago
- ☆20Updated 7 months ago
- GoldenEye is a functional simulator with fault injection capabilities for common and emerging numerical formats, implemented for the PyTo…☆26Updated 11 months ago
- Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.☆58Updated 2 months ago
- A Spatial Accelerator Generation Framework for Tensor Algebra.☆59Updated 3 years ago
- PyLog: An Algorithm-Centric FPGA Programming and Synthesis Flow☆68Updated 2 years ago
- dMazeRunner: Dataflow acceleration optimization infrastructure for coarse-grained programmable accelerators☆47Updated 3 years ago
- ☆58Updated 6 months ago
- PolyLUT is the first quantized neural network training methodology that maps a neuron to a LUT while using multivariate polynomial functi…☆53Updated last year
- Performance and resource models for fpgaConvNet: a Streaming-Architecture-based CNN Accelerator.☆30Updated 10 months ago
- Docker container with tools for the Timeloop/Accelergy tutorial☆22Updated last year
- A fast, accurate trace-based simulator for High-Level Synthesis.☆69Updated 5 months ago
- ☆34Updated 6 years ago
- ☆15Updated last week