cjg91 / trans-fatLinks

An FPGA Accelerator for Transformer Inference

☆85

Alternatives and similar repositories for trans-fat

Users that are interested in trans-fat are comparing it to the libraries listed below

Sorting:

gnodipac886 / ViT-FPGA-TPU
FPGA based Vision Transformer accelerator (Harvard CS205)
☆125Updated 5 months ago
CASR-HKU / MSD-FCCM23
Open-source of MSD framework
☆16Updated last year
xliu0709 / WinoCNN
An HLS based winograd systolic CNN accelerator
☆53Updated 3 years ago
hguq / HG-PIPE
FPGA-based hardware accelerator for Vision Transformer (ViT), with Hybrid-Grained Pipeline.
☆73Updated 5 months ago
albertomarchisio / SwiftTron
☆44Updated 2 years ago
arasi15 / CNN-Accelerator-Implementation-based-on-Eyerissv2
☆113Updated 4 years ago
jha-lab / acceltran
[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers
☆49Updated last year
arc-research-lab / CHARM
CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture
☆147Updated this week
karthisugumar / CSE240D-Hierarchical_Mesh_NoC-Eyeriss_v2
A SystemVerilog implementation of Row-Stationary dataflow and Hierarchical Mesh Network-on-Chip Architecture based on Eyeriss CNN Acceler…
☆163Updated 5 years ago
AlexMontgomerie / fpgaconvnet-tutorial
A collection of tutorials for the fpgaConvNet framework.
☆42Updated 9 months ago
cornell-zhang / FracBNN
FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
☆94Updated 3 years ago
Buck008 / Transformer-Accelerator-Based-on-FPGA
You can run it on pynq z1. The repository contains the relevant Verilog code, Vivado configuration and C code for sdk testing. The size o…
☆179Updated last year
maeri-project / FEATHER
A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching
☆54Updated 3 months ago
BoooC / CNN-Accelerator-Based-on-Eyeriss-v2
A Flexible and Energy Efficient Accelerator For Sparse Convolution Neural Network
☆83Updated 4 months ago
Dazhuzhu-github / systolic-array
verilog实现TPU中的脉动阵列计算卷积的module
☆121Updated 2 months ago
taoyilee / clacc
Deep Learning Accelerator (Convolution Neural Networks)
☆188Updated 7 years ago
fffasttime / AnyPackingNet
☆27Updated 3 months ago
georgia-tech-synergy-lab / SIGMA
RTL implementation of Flex-DPE.
☆106Updated 5 years ago
sharc-lab / Edge-MoE
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts
☆122Updated last year
arc-research-lab / SSR
SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration (Full Paper Accepted in FPGA'24)
☆32Updated this week
BUAA-CI-LAB / Literatures-on-SRAM-based-CIM
A reading list for SRAM-based Compute-In-Memory (CIM) research.
☆71Updated last month
KULeuven-MICAS / DeFiNES
A framework for fast exploration of the depth-first scheduling space for DNN accelerators
☆39Updated 2 years ago
8krisv / CNN-ACCELERATOR
Hardware accelerator for convolutional neural networks
☆47Updated 2 years ago
groupsada / DeepBurning
Automatic generation of FPGA-based learning accelerators for the neural network family
☆67Updated 5 years ago
sunxt99 / PIMCOMP-NN
☆65Updated 5 months ago
pku-liang / Sanger
A co-design architecture on sparse attention
☆52Updated 3 years ago
Xilinx / ResNet50-PYNQ
Quantized ResNet50 Dataflow Acceleration on Alveo, with PYNQ
☆58Updated 3 years ago
Zhu-Zixuan / Bitlet-PE
A bit-level sparsity-awared multiply-accumulate process element.
☆16Updated last year
maomran / softmax
Verilog implementation of Softmax function
☆67Updated 2 years ago
Xilinx / finn-hlslib
Vitis HLS Library for FINN
☆202Updated 3 weeks ago