rehohoho / onnx2versalLinks
Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.
☆14Updated 6 months ago
Alternatives and similar repositories for onnx2versal
Users that are interested in onnx2versal are comparing it to the libraries listed below
Sorting:
- NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions☆37Updated 3 months ago
- SAMO: Streaming Architecture Mapping Optimisation☆33Updated last year
- MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)☆21Updated last year
- ☆71Updated 2 years ago
- ☆23Updated 2 years ago
- ☆20Updated 5 months ago
- ☆29Updated 6 years ago
- A DSL for Systolic Arrays☆80Updated 6 years ago
- ☆58Updated 5 years ago
- Train and deploy LUT-based neural networks on FPGAs☆97Updated last year
- PyTorch model to RTL flow for low latency inference☆128Updated last year
- Quantized ResNet50 Dataflow Acceleration on Alveo, with PYNQ☆58Updated 3 years ago
- HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond☆38Updated this week
- An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).☆81Updated 11 months ago
- ☆102Updated this week
- An MLIR dialect to enable the efficient acceleration of ML model on CGRAs.☆59Updated 9 months ago
- Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.☆54Updated last week
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆46Updated 4 months ago
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated 7 months ago
- ☆56Updated 3 months ago
- Performance and resource models for fpgaConvNet: a Streaming-Architecture-based CNN Accelerator.☆29Updated 8 months ago
- ☆35Updated 3 months ago
- [DAC 2020] Analysis and Optimization of the Implicit Broadcasts in FPGA HLS to Improve Maximum Frequency☆32Updated 4 years ago
- A Spatial Accelerator Generation Framework for Tensor Algebra.☆58Updated 3 years ago
- Implementation of Microscaling data formats in SystemVerilog.☆21Updated last week
- GoldenEye is a functional simulator with fault injection capabilities for common and emerging numerical formats, implemented for the PyTo…☆25Updated 8 months ago
- ☆41Updated last year
- Xilinx Modifications to Halide☆13Updated 4 years ago
- dMazeRunner: Dataflow acceleration optimization infrastructure for coarse-grained programmable accelerators☆45Updated 3 years ago
- A research shell for Alveo V80☆17Updated last week