ok858ok / CP-ViT

Code for "CP-ViT: Cascade Vision Transformer Pruning via Progressive Sparsity Prediction" on CIFAR-10/100.

☆14

Alternatives and similar repositories for CP-ViT:

Users that are interested in CP-ViT are comparing it to the libraries listed below

Cydia2018 / ViT-cifar10-pruning
Vision Transformer Pruning
☆56Updated 3 years ago
jeffreyyu0602 / quantized-training
☆26Updated this week
jha-lab / acceltran
[TCAD'23] AccelTran: A Sparsity-Aware Accelerator for Transformers
☆40Updated last year
GATECH-EIC / ViTALiTy
ViTALiTy (HPCA'23) Code Repository
☆22Updated 2 years ago
aojunzz / DominoSearch
☆18Updated 3 years ago
mit-han-lab / spatten
[HPCA'21] SpAtten: Efficient Sparse Attention Architecture with Cascade Token and Head Pruning
☆84Updated 7 months ago
hatsu3 / Sanger
☆43Updated 3 years ago
MXHX7199 / ICCV_2021_AFP
AFP is a hardware-friendly quantization framework for DNNs, which is contributed by Fangxin Liu and Wenbo Zhao.
☆12Updated 3 years ago
sjtu-zhao-lab / SALO
An efficient spatial accelerator enabling hybrid sparse attention mechanisms for long sequences
☆26Updated last year
fffasttime / AnyPackingNet
☆26Updated 3 weeks ago
pku-liang / Sanger
A co-design architecture on sparse attention
☆52Updated 3 years ago
yanghr / BSQ
BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)
☆40Updated 4 years ago
CASR-HKU / MSD-FCCM23
Open-source of MSD framework
☆16Updated last year
isakedo / DNNsim
☆34Updated 4 years ago
zhexinli / Q-ViT-DeiT
DeiT implementation for Q-ViT
☆24Updated this week
zqu1992 / ALQ
☆15Updated 2 years ago
CASR-HKU / DPACS
☆18Updated 2 years ago
GATECH-EIC / ViTCoD
[HPCA 2023] ViTCoD: Vision Transformer Acceleration via Dedicated Algorithm and Accelerator Co-Design
☆105Updated last year
heymesut / SJTU_microe
A FPGA-based neural network inference accelerator, which won the third place in DAC-SDC
☆28Updated 2 years ago
zhutmost / neuralzip
A Out-of-box PyTorch Scaffold for Neural Network Quantization-Aware-Training (QAT) Research. Website: https://github.com/zhutmost/neuralz…
☆26Updated 2 years ago
peiswang / BitSplit
BitSplit Post-trining Quantization
☆49Updated 3 years ago
kabazoka / ViT-Accelerator
Vision Transformer Accelerator implemented in Vivado HLS for Xilinx FPGAs.
☆11Updated 3 months ago
I-Doctor / RTL_library_of_basic_hardware_units
Here are some implementations of basic hardware units in RTL language (verilog for now), which can be used for area/power evaluation and …
☆11Updated last year
KULeuven-MICAS / zigzag-llm
Model LLM inference on single-core dataflow accelerators
☆10Updated 2 months ago
CASR-HKU / AGNA-FCCM2023
☆12Updated last year
clevercool / ANT-Quantization
☆95Updated last year
sfox14 / block_minifloat
Training with Block Minifloat number representation
☆14Updated 3 years ago
albertomarchisio / SwiftTron
☆43Updated 2 years ago
Zhu-Zixuan / Bitlet-PE
A bit-level sparsity-awared multiply-accumulate process element.
☆14Updated 9 months ago
Alexstrasza98 / Transformer-Quantization
The final project repository for 2022 Spring COMS6998-009 Deep Learning System Performance in Columbia University.
☆7Updated 2 years ago