yaozhewei / MLPruningLinks

MLPruning, PyTorch, NLP, BERT, Structured Pruning

☆20

Alternatives and similar repositories for MLPruning

Users that are interested in MLPruning are comparing it to the libraries listed below

Sorting:

VITA-Group / Structure-LTH
[ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…
☆33Updated 2 years ago
huggingface / block_movement_pruning
Block Sparse movement pruning
☆80Updated 4 years ago
VITA-Group / SFW-Once-for-All-Pruning
[ICLR 2022] "Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining" by Lu Miao*, Xiaolong Luo*, T…
☆30Updated 3 years ago
varunnair18 / FISH
Code for "Training Neural Networks with Fixed Sparse Masks" (NeurIPS 2021).
☆58Updated 3 years ago
htqin / BiBERT
This project is the official implementation of our accepted ICLR 2022 paper BiBERT: Accurate Fully Binarized BERT.
☆88Updated 2 years ago
VITA-Group / SMC-Bench
[ICLR 2023] "Sparsity May Cry: Let Us Fail (Current) Sparse Neural Networks Together!" Shiwei Liu, Tianlong Chen, Zhenyu Zhang, Xuxi Chen…
☆28Updated last year
dguo98 / DiffPruning
Parameter Efficient Transfer Learning with Diff Pruning
☆73Updated 4 years ago
yxli2123 / LoSparse
☆57Updated last year
QingruZhang / PLATON
This pytorch package implements PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance (ICML 2022).
☆46Updated 2 years ago
VITA-Group / ToST
[ICML2022] Training Your Sparse Neural Network Better with Any Mask. Ajay Jaiswal, Haoyu Ma, Tianlong Chen, ying Ding, and Zhangyang Wang
☆28Updated 2 years ago
yaozhewei / HAP
☆43Updated last year
IST-DASLab / WoodFisher
Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)
☆52Updated 4 years ago
microsoft / Stochastic-Mixture-of-Experts
This package implements THOR: Transformer with Stochastic Experts.
☆65Updated 3 years ago
VITA-Group / UVC
[ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…
☆53Updated last year
Zhen-Dong / BitPack
BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.
☆54Updated 2 years ago
BradMcDanel / sdgp
☆10Updated 3 years ago
AIoT-MLSys-Lab / CATE
[ICML 2021 Oral] "CATE: Computation-aware Neural Architecture Encoding with Transformers" by Shen Yan, Kaiqiang Song, Fei Liu, Mi Zhang
☆19Updated 4 years ago
wimh966 / outlier_suppression
The official PyTorch implementation of the NeurIPS2022 (spotlight) paper, Outlier Suppression: Pushing the Limit of Low-bit Transformer L…
☆47Updated 2 years ago
kssteven418 / LTP
[KDD'22] Learned Token Pruning for Transformers
☆98Updated 2 years ago
gilshm / sparq
Post-training sparsity-aware quantization
☆34Updated 2 years ago
JingtongSu / sanity-checking-pruning
Code for Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot
☆42Updated 4 years ago
chenjoya / dropit
DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)
☆31Updated 2 years ago
asappresearch / flop
Pytorch library for factorized L0-based pruning.
☆45Updated last year
lushleaf / Network-Pruning-Greedy-Forward-Selection
Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection
☆21Updated 4 years ago
Shiweiliuiiiiiii / In-Time-Over-Parameterization
[ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, De…
☆45Updated last year
microsoft / fnl_paper
Factorized Neural Layers
☆29Updated last year
LiuXiaoxuanPKU / GACT-ICML
☆42Updated 2 years ago
varun19299 / rigl-reproducibility
Reproducing RigL (ICML 2020) as a part of ML Reproducibility Challenge 2020
☆28Updated 3 years ago
kssteven418 / BigLittleDecoder
[NeurIPS'23] Speculative Decoding with Big Little Decoder
☆92Updated last year
mlpen / LookupFFN
☆20Updated last year