enyac-group/MaxEVA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/enyac-group/MaxEVA)

enyac-group / MaxEVA

MaxEVA: Maximizing the Efficiency of Matrix Multiplication on Versal AI Engine (accepted as full paper at FPT'23)

☆22

Alternatives and similar repositories for MaxEVA

Users that are interested in MaxEVA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

arc-research-lab / CHARM
View on GitHub
CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture
☆169Mar 12, 2026Updated last month
hanchenye / polyaie
View on GitHub
An MLIR-based compiler from C/C++ to AMD-Xilinx Versal AIE
☆17Aug 5, 2022Updated 3 years ago
Xilinx / xup_aie_training
View on GitHub
Hands-on experience programming AI Engines using Vitis Unified Software Platform
☆40Jul 24, 2024Updated last year
rehohoho / onnx2versal
View on GitHub
Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.
☆17Dec 29, 2024Updated last year
Xilinx / aie-rt
View on GitHub
☆25Jan 7, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
pierpaolomori / SemanticSegmentationFPGA
View on GitHub
☆11Sep 3, 2022Updated 3 years ago
arc-research-lab / SSR
View on GitHub
SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration (Full Paper Accepted in FPGA'24)
☆35Mar 12, 2026Updated last month
fchirono / cyclostationarity_analysis
View on GitHub
Python functions and scripts to analyse cyclostationary signals
☆26Feb 14, 2023Updated 3 years ago
Xilinx / logicnets
View on GitHub
Train and deploy LUT-based neural networks on FPGAs
☆112Jun 12, 2024Updated last year
alinxalinx / VD100_2023.2
View on GitHub
The VD100 development board is based on the Xilinx Versal AI Edge series chip xcve2302 and is designed with a core board and a bottom boa…
☆19Jul 9, 2024Updated last year
template-hls / template-hls-float
View on GitHub
☆30Apr 26, 2019Updated 7 years ago
Xilinx / mlir-aie
View on GitHub
An MLIR-based toolchain for AMD AI Engine-enabled devices.
☆630Updated this week
maestro-project / AIrchitect-v2
View on GitHub
[DATE 2025] Official implementation and dataset of AIrchitect v2: Learning the Hardware Accelerator Design Space through Unified Represen…
☆19Jan 17, 2025Updated last year
PannenetsF / TQT
View on GitHub
TQT's pytorch implementation.
☆21Dec 17, 2021Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
fffasttime / AnyPackingNet
View on GitHub
☆32Mar 31, 2025Updated last year
sharc-lab / Edge-MoE
View on GitHub
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts
☆138May 10, 2024Updated last year
CAS-CLab / BlockConv
View on GitHub
[TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA
☆17Jul 7, 2022Updated 3 years ago
Xilinx / xup_compute_acceleration
View on GitHub
Hands-on experience using the Vitis unified software platform with Xilinx FPGA hardware
☆49Jul 24, 2024Updated last year
uwsampa / mcpat
View on GitHub
McPAT modeling framework
☆12Oct 18, 2014Updated 11 years ago
synergy-noc-generators / Proteus
View on GitHub
☆10Jan 25, 2023Updated 3 years ago
Xilinx / llvm-aie
View on GitHub
Fork of LLVM to support AMD AIEngine processors
☆196Updated this week
nqdtan / vck5000_vivado_ulp
View on GitHub
An alternative Vivado custom design example (to fully Vitis) for the User Logic Partition targeting VCK5000
☆13Jul 16, 2024Updated last year
kiabuzz / CompressedLUT
View on GitHub
A tool to generate optimized hardware files for univariate functions.
☆29Apr 5, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
jerry-D / 64-bit-Universal-Floating-Point-ISA-Compute-Engine
View on GitHub
RISC-V Rocket Chip Strap-on-Booster with Fused Universal Neural Network (FuNN) eNNgine
☆21Mar 17, 2022Updated 4 years ago
GATECH-EIC / Auto-NBA
View on GitHub
[ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…
☆16Jan 3, 2022Updated 4 years ago
ZSusskind / BTHOWeN
View on GitHub
Code to accompany "Weightless Neural Networks for Efficient Edge Inference", PACT 2022
☆22Nov 15, 2022Updated 3 years ago
cornell-zhang / FracBNN
View on GitHub
FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
☆99Oct 2, 2021Updated 4 years ago
ttambe / AdaptivFloat
View on GitHub
Adaptive floating-point based numerical format for resilient deep learning
☆14Apr 11, 2022Updated 4 years ago
scikit-hep / hepconvert
View on GitHub
☆14Mar 3, 2025Updated last year
PSCLab-ASU / Systolic-CNN
View on GitHub
☆17Feb 13, 2021Updated 5 years ago
zhexinli / Q-ViT-DeiT
View on GitHub
DeiT implementation for Q-ViT
☆25Apr 21, 2025Updated last year
cornell-zhang / allo
View on GitHub
Allo Accelerator Design and Programming Framework (PLDI'24)
☆373Mar 13, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
UCLA-VAST / Stream-HLS
View on GitHub
An MLIR Complier for PyTorch/C/C++ Codes into HLS Dataflow Designs
☆66Aug 1, 2025Updated 9 months ago
dicecco1 / fpga_cpfp
View on GitHub
HLS Custom-Precision Floating-Point Library
☆13Nov 6, 2017Updated 8 years ago
GATECH-EIC / ViTCoD
View on GitHub
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆130Jun 27, 2023Updated 2 years ago
os-hxfan / Static_BFP_HW
View on GitHub
This repository contains the hardware implementation for Static BFP convolution on FPGA
☆10Oct 15, 2019Updated 6 years ago
yanghr / BSQ
View on GitHub
BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)
☆42Jan 12, 2021Updated 5 years ago
Elrori / OV_camera_on_FPGA
View on GitHub
OV7670 (Verilog HDL)Drive for FPGA
☆19Mar 4, 2019Updated 7 years ago
linghaosong / Sextans
View on GitHub
An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).
☆94Jul 26, 2024Updated last year