dongkuanx27 / SparseBERTLinks
(SparseBERT) Rethinking Network Pruning -- under the Pre-train and Fine-tune Paradigm (NAACL'21)
☆8Updated 3 years ago
Alternatives and similar repositories for SparseBERT
Users that are interested in SparseBERT are comparing it to the libraries listed below
Sorting:
- Implementation of "Binary Graph Convolutional Network", CVPR 2021, and TPAMI 2024.☆26Updated last year
- This pytorch package implements PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance (ICML 2022).☆46Updated 2 years ago
- ICLR 2021☆48Updated 4 years ago
- ☆64Updated 4 years ago
- Paper lists of neural architecture search (NAS)☆134Updated 3 years ago
- ☆12Updated last year
- [NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers☆190Updated 2 years ago
- [Neurips 2021] Sparse Training via Boosting Pruning Plasticity with Neuroregeneration☆31Updated 2 years ago
- Open Source Neural Machine Translation in PyTorch☆17Updated 6 years ago
- [ACL 2022] Structured Pruning Learns Compact and Accurate Models https://arxiv.org/abs/2204.00408☆196Updated 2 years ago
- ☆10Updated last year
- [ICLR-2020] Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers.☆31Updated 5 years ago
- Conditional channel- and precision-pruning on neural networks☆72Updated 5 years ago
- Vision Transformer Pruning☆57Updated 3 years ago
- PyTorch implementation of "Cluster-GCN: An Efficient Algorithm for Training Deep and Large Graph Convolutional Networks"☆14Updated 2 years ago
- [KDD'22] Learned Token Pruning for Transformers☆98Updated 2 years ago
- [ICML'21 Oral] I-BERT: Integer-only BERT Quantization☆253Updated 2 years ago
- ☆13Updated 2 years ago
- BSQ: Exploring Bit-Level Sparsity for Mixed-Precision Neural Network Quantization (ICLR 2021)☆40Updated 4 years ago
- ☆15Updated 3 years ago
- Implementation of a Quantized Transformer Model☆19Updated 6 years ago
- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing☆335Updated last year
- A simple reimplement Online Knowledge Distillation via Collaborative Learning with pytorch☆49Updated 2 years ago
- Source code for IJCAI 2022 Long paper: Parameter-Efficient Sparsity for Large Language Models Fine-Tuning.☆14Updated 3 years ago
- A pytorch &keras implementation and demo of Fastformer.☆189Updated 2 years ago
- Training models with ternary quantized weights using PyTorch☆13Updated 6 years ago
- Code for our paper "Binary Graph Neural Networks", CVPR 2021☆37Updated 4 years ago
- [AAAI '23] PINAT: A Permutation INvariance Augmented Transformer for NAS Predictor☆31Updated 2 years ago
- ☆13Updated last year
- This PyTorch package implements MoEBERT: from BERT to Mixture-of-Experts via Importance-Guided Adaptation (NAACL 2022).☆108Updated 3 years ago