rehohoho / onnx2versalLinks
Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.
☆16Updated 11 months ago
Alternatives and similar repositories for onnx2versal
Users that are interested in onnx2versal are comparing it to the libraries listed below
Sorting:
- Xilinx Modifications to Halide☆14Updated 4 years ago
- MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)☆22Updated last year
- NeuraLUT-Assemble☆43Updated 3 months ago
- SAMO: Streaming Architecture Mapping Optimisation☆34Updated 2 years ago
- ☆63Updated 5 years ago
- ☆22Updated 3 years ago
- A DSL for Systolic Arrays☆82Updated 6 years ago
- ☆30Updated 6 years ago
- Train and deploy LUT-based neural networks on FPGAs☆101Updated last year
- ☆72Updated 2 years ago
- A Spatial Accelerator Generation Framework for Tensor Algebra.☆60Updated 3 years ago
- ☆19Updated 9 months ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆49Updated 9 months ago
- Resource Utilization and Latency Estimation for ML on FPGA.☆17Updated 2 months ago
- PyTorch model to RTL flow for low latency inference☆130Updated last year
- An implementation of a BinaryConnect network for cifar10☆11Updated 6 years ago
- ☆61Updated 8 months ago
- An MLIR dialect to enable the efficient acceleration of ML model on CGRAs.☆64Updated last year
- A research shell for Alveo V80☆19Updated last month
- Repository for work on on Xilinx's matrix vector activation unit's RTL implementation. Documentation available at: https://asadalam.githu…☆18Updated 3 years ago
- Performance and resource models for fpgaConvNet: a Streaming-Architecture-based CNN Accelerator.☆30Updated last year
- A fast, accurate trace-based simulator for High-Level Synthesis.☆72Updated 8 months ago
- ☆16Updated this week
- Multi-core HW accelerator mapping optimization framework for layer-fused ML workloads.☆64Updated 4 months ago
- Multi-target compiler for Sum-Product Networks, based on MLIR and LLVM.☆24Updated last year
- An MLIR Complier for PyTorch/C/C++ Codes into HLS Dataflow Designs☆54Updated 3 months ago
- An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE☆16Updated 3 years ago
- An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).☆91Updated last year
- dMazeRunner: Dataflow acceleration optimization infrastructure for coarse-grained programmable accelerators☆47Updated 3 years ago
- PolyLUT is the first quantized neural network training methodology that maps a neuron to a LUT while using multivariate polynomial functi…☆54Updated last year