enyac-group / MaxEVA
MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)
☆18Updated 9 months ago
Alternatives and similar repositories for MaxEVA:
Users that are interested in MaxEVA are comparing it to the libraries listed below
- ☆27Updated 5 years ago
- ☆22Updated 2 years ago
- HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond☆23Updated last month
- ☆18Updated 2 years ago
- [DAC 2020] Analysis and Optimization of the Implicit Broadcasts in FPGA HLS to Improve Maximum Frequency☆32Updated 3 years ago
- ☆33Updated 5 years ago
- RISC-V ISA based 32-bit processor written in HLS☆17Updated 5 years ago
- dMazeRunner: Dataflow acceleration optimization infrastructure for coarse-grained programmable accelerators☆45Updated 2 years ago
- ☆71Updated last year
- ☆25Updated last month
- ☆3Updated 3 years ago
- SAMO: Streaming Architecture Mapping Optimisation☆32Updated last year
- ☆33Updated 3 years ago
- Implementation of Microscaling data formats in SystemVerilog.☆13Updated 4 months ago
- A fast, accurate trace-based simulator for High-Level Synthesis.☆39Updated 8 months ago
- ☆9Updated last year
- ☆23Updated 4 years ago
- A general framework for optimizing DNN dataflow on systolic array☆33Updated 4 years ago
- ☆13Updated 4 years ago
- ☆33Updated last week
- SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration (Full Paper Accepted in FPGA'24)☆27Updated 5 months ago
- Contains FPGA benchmarks for Vivado HLS and Catapult HLS☆26Updated 4 years ago
- PolyLUT is the first quantized neural network training methodology that maps a neuron to a LUT while using multivariate polynomial functi…☆45Updated 11 months ago
- ☆15Updated 3 years ago
- High-Performance Sparse Linear Algebra on HBM-Equipped FPGAs Using HLS☆84Updated 3 months ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆42Updated 2 years ago
- Designs for finalist teams of the DAC System Design Contest☆36Updated 4 years ago
- Implementation of paper "GraphACT: Accelerating GCN Training on CPU-FPGA Heterogeneous Platform".☆10Updated 4 years ago
- HLS implemented systolic array structure☆41Updated 7 years ago
- ☆12Updated last year