rehohoho / onnx2versal
Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.
☆14Updated 4 months ago
Alternatives and similar repositories for onnx2versal
Users that are interested in onnx2versal are comparing it to the libraries listed below
Sorting:
- MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)☆20Updated last year
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆23Updated 5 months ago
- ☆29Updated 6 years ago
- NeuraLUT: Hiding Neural Network Density in Boolean Synthesizable Functions☆31Updated last month
- ☆14Updated 3 years ago
- ☆51Updated last month
- dMazeRunner: Dataflow acceleration optimization infrastructure for coarse-grained programmable accelerators☆45Updated 3 years ago
- ☆23Updated 2 years ago
- An MLIR Complier for PyTorch/C/C++ Codes into HLS Dataflow Designs☆32Updated this week
- ☆71Updated 2 years ago
- HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond☆35Updated 3 weeks ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆46Updated 2 months ago
- ☆27Updated 6 months ago
- A DSL for Systolic Arrays☆79Updated 6 years ago
- ☆59Updated 2 weeks ago
- A fast, accurate trace-based simulator for High-Level Synthesis.☆44Updated last month
- ☆57Updated 5 years ago
- A general framework for optimizing DNN dataflow on systolic array☆35Updated 4 years ago
- An Open-Hardware CGRA for accelerated computation on the edge.☆25Updated 8 months ago
- FRAME: Fast Roofline Analytical Modeling and Estimation☆34Updated last year
- ARIES: An Agile MLIR-Based Compilation Flow for Reconfigurable Devices with AI Engines (FPGA 2025 Best Paper Nominee)☆26Updated this week
- EQueue Dialect☆40Updated 3 years ago
- [DAC 2020] Analysis and Optimization of the Implicit Broadcasts in FPGA HLS to Improve Maximum Frequency☆32Updated 4 years ago
- Dynamically Reconfigurable Architecture Template and Cycle-level Microarchitecture Simulator for Dataflow AcCelerators☆28Updated last year
- Provides the hardware code for the paper "EBPC: Extended Bit-Plane Compression for Deep Neural Network Inference and Training Accelerator…☆24Updated 4 years ago
- Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.☆53Updated 3 weeks ago
- Implementation of Microscaling data formats in SystemVerilog.☆18Updated 8 months ago
- SAMO: Streaming Architecture Mapping Optimisation☆32Updated last year
- ☆97Updated last week
- ☆13Updated 4 years ago