enyac-group / MaxEVA
MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)
☆18Updated 10 months ago
Alternatives and similar repositories for MaxEVA:
Users that are interested in MaxEVA are comparing it to the libraries listed below
- ☆33Updated 5 years ago
- ☆34Updated 3 years ago
- [TECS'23] A project on the co-design of Accelerators and CNNs.☆20Updated 2 years ago
- ☆22Updated 2 years ago
- HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond☆28Updated 3 weeks ago
- ☆33Updated 3 weeks ago
- Contains FPGA benchmarks for Vivado HLS and Catapult HLS☆26Updated 4 years ago
- A Spatial Accelerator Generation Framework for Tensor Algebra.☆55Updated 3 years ago
- dMazeRunner: Dataflow acceleration optimization infrastructure for coarse-grained programmable accelerators☆45Updated 2 years ago
- ☆25Updated 2 months ago
- A systolic array simulator for multi-cycle MACs and varying-byte words, with the paper accepted to HPCA 2022.☆68Updated 3 years ago
- ☆27Updated 5 years ago
- ☆18Updated 2 years ago
- Linux docker for the DNN accelerator exploration infrastructure composed of Accelergy and Timeloop☆48Updated 3 weeks ago
- [DAC 2020] Analysis and Optimization of the Implicit Broadcasts in FPGA HLS to Improve Maximum Frequency☆32Updated 4 years ago
- ☆71Updated 2 years ago
- ☆3Updated 3 years ago
- FlexASR: A Reconfigurable Hardware Accelerator for Attention-based Seq-to-Seq Networks☆42Updated 2 years ago
- ☆9Updated 2 years ago
- ☆19Updated last year
- ☆13Updated 4 years ago
- An LSTM template and a few examples using Vivado HLS☆44Updated 9 months ago
- A fast, accurate trace-based simulator for High-Level Synthesis.☆42Updated last week
- RISC-V ISA based 32-bit processor written in HLS☆17Updated 5 years ago
- Designs for finalist teams of the DAC System Design Contest☆36Updated 4 years ago
- A general framework for optimizing DNN dataflow on systolic array☆33Updated 4 years ago
- SAMO: Streaming Architecture Mapping Optimisation☆32Updated last year
- Domain-Specific Architecture Generator 2☆21Updated 2 years ago
- An HBM FPGA based SpMV Accelerator☆12Updated 5 months ago
- Systolic array implementations for Cholesky, LU, and QR decomposition☆39Updated 3 months ago