sharc-lab / Edge-MoELinks

Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts

☆129

Alternatives and similar repositories for Edge-MoE

Users that are interested in Edge-MoE are comparing it to the libraries listed below

Sorting:

GATECH-EIC / ViTCoD
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆124Updated 2 years ago
jha-lab / acceltran
[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers
☆54Updated 2 years ago
hguq / HG-PIPE
FPGA-based hardware accelerator for Vision Transformer (ViT), with Hybrid-Grained Pipeline.
☆108Updated 10 months ago
cjg91 / trans-fat
An FPGA Accelerator for Transformer Inference
☆92Updated 3 years ago
albertomarchisio / SwiftTron
☆46Updated 2 years ago
arc-research-lab / SSR
SSR: Spatial Sequential Hybrid Architecture for Latency Throughput Tradeoff in Transformer Acceleration (Full Paper Accepted in FPGA'24)
☆35Updated last week
CASR-HKU / MSD-FCCM23
Open-source of MSD framework
☆16Updated 2 years ago
gnodipac886 / ViT-FPGA-TPU
FPGA based Vision Transformer accelerator (Harvard CS205)
☆139Updated 9 months ago
mit-han-lab / spatten
[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
☆115Updated last year
mit-emze / cimloop
☆74Updated 2 months ago
pku-liang / Sanger
A co-design architecture on sparse attention
☆54Updated 4 years ago
KULeuven-MICAS / DeFiNES
A framework for fast exploration of the depth-first scheduling space for DNN accelerators
☆42Updated 2 years ago
AlexMontgomerie / fpgaconvnet-tutorial
A collection of tutorials for the fpgaConvNet framework.
☆46Updated last year
xliu0709 / WinoCNN
An HLS based winograd systolic CNN accelerator
☆54Updated 4 years ago
cornell-zhang / FracBNN
FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
☆95Updated 4 years ago
aliemo / transfomers-silicon-research
Research and Materials on Hardware implementation of Transformer Model
☆292Updated 9 months ago
fffasttime / AnyPackingNet
☆31Updated 8 months ago
clevercool / ANT-Quantization
☆112Updated 2 years ago
hatsu3 / Sanger
☆47Updated 4 years ago
maeri-project / FEATHER
A Reconfigurable Accelerator with Data Reordering Support for Low-Cost On-Chip Dataflow Switching
☆71Updated last month
hazooree / LeNet-CNN-Accelerator-Hardware-for-FPGA
An open source Verilog Based LeNet-1 Parallel CNNs Accelerator for FPGAs in Vivado 2017
☆19Updated 6 years ago
ECASLab / hls-fpga-accelerators
Collection of kernel accelerators optimised for LLM execution
☆25Updated 3 weeks ago
isakedo / DNNsim
☆35Updated 5 years ago
linghaosong / Sextans
An FPGA accelerator for general-purpose Sparse-Matrix Dense-Matrix Multiplication (SpMM).
☆91Updated last year
wangxy-2000 / pimsim-nn
☆58Updated last year
ebby-s / MX-for-FPGA
Implementation of Microscaling data formats in SystemVerilog.
☆28Updated 5 months ago
hisrg / Neural-Network-Compression-and-Accelerator-on-Hardware
My name is Fang Biao. I'm currently pursuing my Master degree with the college of Computer Science and Engineering, Si Chuan University, …
☆53Updated 2 years ago
IBM / 3D-CiM-LLM-Inference-Simulator
Simulator for LLM inference on an abstract 3D AIMC-based accelerator
☆25Updated 2 months ago
arc-research-lab / CHARM
CHARM: Composing Heterogeneous Accelerators on Heterogeneous SoC Architecture
☆163Updated this week
jeffreyyu0602 / quantized-training
☆32Updated this week