puccinic/Transformer_dataflow

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/puccinic/Transformer_dataflow)

puccinic / Transformer_dataflow

C++ code for HLS FPGA implementation of transformer

☆23

Alternatives and similar repositories for Transformer_dataflow

Users that are interested in Transformer_dataflow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RakeshUIUC / multihead_attn_accelerator
View on GitHub
Accelerate multihead attention transformer model using HLS for FPGA
☆12Dec 7, 2023Updated 2 years ago
gl9544 / vit_transformer_fpga
View on GitHub
☆15Aug 10, 2023Updated 2 years ago
cjg91 / trans-fat
View on GitHub
An FPGA Accelerator for Transformer Inference
☆93Apr 29, 2022Updated 4 years ago
zhengchen3 / HLS_Transformer
View on GitHub
c++ version of ViT
☆12Nov 13, 2022Updated 3 years ago
gnodipac886 / ViT-FPGA-TPU
View on GitHub
FPGA based Vision Transformer accelerator (Harvard CS205)
☆155Feb 11, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
qyw123 / transformer_core
View on GitHub
a student trainning project for HLS and transformer
☆11Oct 19, 2022Updated 3 years ago
Buck008 / Transformer-Accelerator-Based-on-FPGA
View on GitHub
You can run it on pynq z1. The repository contains the relevant Verilog code, Vivado configuration and C code for sdk testing. The size o…
☆247Mar 24, 2024Updated 2 years ago
arc-research-lab / SSR
View on GitHub
SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration (Full Paper Accepted in FPGA'24)
☆35Mar 12, 2026Updated last month
ShixiangLi / Transformer_FPGA
View on GitHub
☆15Mar 22, 2024Updated 2 years ago
ECASLab / hls-fpga-accelerators
View on GitHub
Collection of kernel accelerators optimised for LLM execution
☆30Feb 26, 2026Updated 2 months ago
lllibano / SystolicArray
View on GitHub
A parametric RTL code generator of an efficient integer MxM Systolic Array implementation for Xilinx FPGAs.
☆36Aug 28, 2025Updated 8 months ago
Fiwo735 / Transformer_Neural_Network_HLS
View on GitHub
☆14Jun 22, 2022Updated 3 years ago
maestro-project / AIrchitect-v2
View on GitHub
[DATE 2025] Official implementation and dataset of AIrchitect v2: Learning the Hardware Accelerator Design Space through Unified Represen…
☆19Jan 17, 2025Updated last year
jgoeders / dac_sdc_2021_designs
View on GitHub
☆19Mar 16, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
kabazoka / ViT-Accelerator
View on GitHub
（Not actively updating）Vision Transformer Accelerator implemented in Vivado HLS for Xilinx FPGAs.
☆20Dec 29, 2024Updated last year
sharc-lab / Edge-MoE
View on GitHub
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts
☆138May 10, 2024Updated last year
embedeep / FREE-TPU-V3plus-for-FPGA
View on GitHub
FREE TPU V3plus for FPGA is the free version of a commercial AI processor (EEP-TPU) for Deep Learning EDGE Inference
☆175Jun 9, 2023Updated 2 years ago
YutaPic / FPU
View on GitHub
For CPU experiment
☆14Feb 23, 2021Updated 5 years ago
sherylll / lenet5-accelerator
View on GitHub
FPGA and GPU acceleration of LeNet5
☆36Jul 9, 2019Updated 6 years ago
EthanBnntt / tinygrad-vit
View on GitHub
A minimalist implementation of the ViT (Vision Transformer) model, using tinygrad
☆16Sep 2, 2024Updated last year
AlexMontgomerie / fpgaconvnet-hls
View on GitHub
☆33Nov 7, 2024Updated last year
ribesstefano / Mapping-Multiple-LSTM-Models-on-FPGAs
View on GitHub
Includes the SVD-based approximation algorithms for compressing deep learning models and the FPGA accelerators exploiting such approximat…
☆16Mar 3, 2023Updated 3 years ago
aliemo / transfomers-silicon-research
View on GitHub
Research and Materials on Hardware implementation of Transformer Model
☆306Feb 28, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
PCov3r / FPGA_Handwritten_digit_recognition
View on GitHub
A Verilog implementation of a hand-written digit recognition Neural Network
☆11Nov 16, 2024Updated last year
BoChen-Ye / Tiny_LeViT_Hardware_Accelerator
View on GitHub
This is my hobby project with System Verilog to accelerate LeViT Network which contain CNN and Attention layer.
☆36Aug 13, 2024Updated last year
amsacks / OV7670-camera
View on GitHub
A RTL-based project in Verilog that shows real-time video captured by a CMOS camera OV7670 and displayed on a monitor through VGA at 640 …
☆30Mar 18, 2023Updated 3 years ago
hguq / HG-PIPE
View on GitHub
FPGA-based hardware accelerator for Vision Transformer (ViT), with Hybrid-Grained Pipeline.
☆140Jan 20, 2025Updated last year
bselimoglu / SoC-ZedBoard-Zynq-7000-Labs
View on GitHub
Hardware and Software Co-design implementations
☆15Dec 5, 2019Updated 6 years ago
spcl / gemm_hls
View on GitHub
Scalable systolic array-based matrix-matrix multiplication implemented in Vivado HLS for Xilinx FPGAs.
☆384Jan 20, 2025Updated last year
shihuihong214 / P2-ViT
View on GitHub
☆11Jun 4, 2024Updated last year
kookma / Persian-Beamer-Templates
View on GitHub
A collection of Beamer samples in Persian
☆16May 7, 2024Updated last year
ChengZhang-98 / QERA
View on GitHub
Official implementation of the ICLR'25 paper "QERA: an Analytical Framework for Quantization Error Reconstruction".
☆14Feb 4, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
godfather991 / UniNDP
View on GitHub
Artifact material for [HPCA 2025] #2108 "UniNDP: A Unified Compilation and Simulation Tool for Near DRAM Processing Architectures"
☆54Sep 1, 2025Updated 8 months ago
kaiiiz / hls-spmv
View on GitHub
High-level synthesis (HLS) implementation of Sparse Matrix Vector Multiplication
☆19Feb 17, 2022Updated 4 years ago
PKUZHOU / Vibe-GPU
View on GitHub
Vibe Coding A GPGPU via Cursor + Gemini3 Pro
☆82Nov 23, 2025Updated 5 months ago
ingur / bitlinear-pytorch
View on GitHub
Implementation of the BitLinear layer from: The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
☆14Sep 11, 2024Updated last year
Xilinx / finn-hlslib
View on GitHub
Vitis HLS Library for FINN
☆220Feb 25, 2026Updated 2 months ago
amirata051 / Bearing-DiffRUL-XJTU-SY
View on GitHub
☆14Mar 5, 2025Updated last year
MedChaabane / Autonomous-flight-of-the-drone-AR.Drone-using-OpenCV
View on GitHub
Autonomous drone using detected ball to command the direction of the drone
☆26Nov 1, 2018Updated 7 years ago