dongkuanx27 / SparseBERTLinks

(SparseBERT) Rethinking Network Pruning -- under the Pre-train and Fine-tune Paradigm (NAACL'21)

☆9

Alternatives and similar repositories for SparseBERT

Users that are interested in SparseBERT are comparing it to the libraries listed below

Sorting:

bywmm / Bi-GCN
Implementation of "Binary Graph Convolutional Network", CVPR 2021, and TPAMI 2024.
☆26Updated last year
WoosukKwon / retraining-free-pruning
[NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers
☆191Updated 2 years ago
yuchaoli / PST
Source code for IJCAI 2022 Long paper: Parameter-Efficient Sparsity for Large Language Models Fine-Tuning.
☆15Updated 3 years ago
ShunLu91 / PINAT
[AAAI '23] PINAT: A Permutation INvariance Augmented Transformer for NAS Predictor
☆31Updated 2 years ago
KwangHoonAn / PACT
Reproducing Quantization paper PACT
☆64Updated 3 years ago
camlsys / degree-quant
ICLR 2021
☆48Updated 4 years ago
kssteven418 / LTP
[KDD'22] Learned Token Pruning for Transformers
☆98Updated 2 years ago
Kelvinyu1117 / LSQ-implementation
This is an unofficial implementation of the paper - LEARNED STEP SIZE QUANTIZATION at ICLR 2020
☆8Updated 4 years ago
htqin / BiBERT
This project is the official implementation of our accepted ICLR 2022 paper BiBERT: Accurate Fully Binarized BERT.
☆88Updated 2 years ago
han-shi / SparseBERT
☆13Updated 2 years ago
kssteven418 / I-BERT
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
☆255Updated 2 years ago
VITA-Group / GraNet
[Neurips 2021] Sparse Training via Boosting Pruning Plasticity with Neuroregeneration
☆31Updated 2 years ago
QingruZhang / PLATON
This pytorch package implements PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance (ICML 2022).
☆46Updated 2 years ago
Cydia2018 / ViT-cifar10-pruning
Vision Transformer Pruning
☆57Updated 3 years ago
yanghr / BSQ
BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)
☆41Updated 4 years ago
zhexinli / Q-ViT-DeiT
DeiT implementation for Q-ViT
☆24Updated 3 months ago
VITA-Group / UVC
[ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…
☆54Updated last year
yxli2123 / LoSparse
☆59Updated last year
GATECH-EIC / ShiftAddViT
[NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
☆31Updated last year
cvlab-yonsei / EWGS
An official implementation of "Network Quantization with Element-wise Gradient Scaling" (CVPR 2021) in PyTorch.
☆93Updated 2 years ago
hustzxd / LSQuantization
The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)
☆137Updated 4 years ago
Andrew-Tierno / QuantizedTransformer
Implementation of a Quantized Transformer Model
☆19Updated 6 years ago
wimh966 / outlier_suppression
The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…
☆48Updated 2 years ago
cornell-zhang / dnn-gating
Conditional channel- and precision-pruning on neural networks
☆72Updated 5 years ago
z-hXu / ReCU
Pytorch implementation of our paper accepted by ICCV 2021 -- ReCU: Reviving the Dead Weights in Binary Neural Networks http://arxiv.org/a…
☆39Updated 3 years ago
HuangOwen / Quantization-Variation
[TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…
☆45Updated 10 months ago
htqin / DSG
This project is the official implementation of our accepted IEEE TPAMI paper Diverse Sample Generation: Pushing the Limit of Data-free Qu…
☆14Updated 2 years ago
zyxxmu / Bi-Mask
Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"
☆12Updated 2 years ago
shaoyiHusky / SparseProgressiveDistillation
☆12Updated last year
aojunzz / DominoSearch
☆19Updated 3 years ago