sharc-lab / Edge-MoE
Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts
☆82Updated 4 months ago
Related projects: ⓘ
- An FPGA Accelerator for Transformer Inference☆69Updated 2 years ago
- [HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design☆88Updated last year
- The codes and artifacts associated with our MICRO'22 paper titled: "Adaptable Butterfly Accelerator for Attention-based NNs via Hardware …☆103Updated last year
- [TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers☆30Updated 9 months ago
- [HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning☆64Updated 3 weeks ago
- You can run it on pynq z1. The repository contains the relevant Verilog code, Vivado configuration and C code for sdk testing. The size o…☆92Updated 5 months ago
- An HLS based winograd systolic CNN accelerator☆46Updated 3 years ago
- SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration (Full Paper Accepted in FPGA'24)☆23Updated last month
- FPGA based Vision Transformer accelerator (Harvard CS205)☆74Updated 9 months ago
- A co-design architecture on sparse attention☆41Updated 3 years ago
- Open-source of MSD framework☆14Updated last year
- ☆28Updated last year
- ☆12Updated last year
- FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations☆87Updated 2 years ago
- A FPGA-based neural network inference accelerator, which won the third place in DAC-SDC☆28Updated 2 years ago
- ☆75Updated 10 months ago
- Research and Materials on Hardware implementation of Transformer Model☆195Updated last month
- A collection of tutorials for the fpgaConvNet framework.☆28Updated last month
- ☆67Updated 4 years ago
- ☆27Updated 4 years ago
- ☆37Updated 3 years ago
- [TVLSI'23] This repository contains the source code for the paper "FireFly: A High-Throughput Hardware Accelerator for Spiking Neural Net…☆15Updated 5 months ago
- Verilog implementation of Softmax function☆45Updated 2 years ago
- [ASAP 2020; FPGA 2020] Hardware architecture to accelerate GNNs (common IP modules for minibatch training and full batch inference)☆41Updated 3 years ago
- ☆23Updated 6 months ago
- FPGA-based neural network inference project for 2020 DAC System Design Contest☆109Updated 3 years ago
- ☆38Updated last week
- An open source Verilog Based LeNet-1 Parallel CNNs Accelerator for FPGAs in Vivado 2017☆12Updated 5 years ago
- 2020 xilinx summer school☆16Updated 4 years ago
- A framework for fast exploration of the depth-first scheduling space for DNN accelerators☆29Updated last year