microsoft / only_train_onceLinks

OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM

☆47

Alternatives and similar repositories for only_train_once

Users that are interested in only_train_once are comparing it to the libraries listed below

Sorting:

HuangOwen / QAT-ACS
[TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"
☆33Updated 10 months ago
ModelTC / QLLM
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…
☆39Updated last year
hikvision-research / Unified-Normalization
# Unified Normalization (ACM MM'22) By Qiming Yang, Kai Zhang, Chaoxiang Lan, Zhi Yang, Zheyang Li, Wenming Tan, Jun Xiao, and Shiliang P…
☆34Updated 2 years ago
facebookresearch / DepthShrinker
[ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …
☆71Updated 3 years ago
Qualcomm-AI-research / oscillations-qat
☆76Updated 2 years ago
MingSun-Tse / TPP
[ICLR'23] Trainability Preserving Neural Pruning (PyTorch)
☆33Updated 2 years ago
ziplab / EcoFormer
[NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"
☆72Updated 2 years ago
NVlabs / SMCP
☆21Updated 2 years ago
htqin / BiBERT
This project is the official implementation of our accepted ICLR 2022 paper BiBERT: Accurate Fully Binarized BERT.
☆88Updated 2 years ago
MingSun-Tse / Regularization-Pruning
[ICLR'21] Neural Pruning via Growing Regularization (PyTorch)
☆83Updated 4 years ago
VITA-Group / UVC
[ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…
☆53Updated last year
ZiweiWangTHU / GMPQ
This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…
☆25Updated 3 years ago
htqin / BiBench
[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…
☆56Updated last year
jaeho-lee / layer-adaptive-sparsity
In progress.
☆65Updated last year
HuangOwen / Quantization-Variation
[TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…
☆45Updated 9 months ago
snap-research / F8Net
[ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization
☆95Updated 3 years ago
facebookresearch / bit
Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer
☆109Updated 2 years ago
ziplab / QTool
Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)
☆71Updated 3 years ago
MingSun-Tse / Why-the-State-of-Pruning-so-Confusing
[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…
☆40Updated 2 years ago
Jangho-Kim / PSG-pytorch
Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)
☆26Updated 4 years ago
htqin / DSG
This project is the official implementation of our accepted IEEE TPAMI paper Diverse Sample Generation: Pushing the Limit of Data-free Qu…
☆14Updated 2 years ago
liuzechun / Nonuniform-to-Uniform-Quantization
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.
☆133Updated 3 years ago
VITA-Group / SFW-Once-for-All-Pruning
[ICLR 2022] "Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining" by Lu Miao*, Xiaolong Luo*, T…
☆30Updated 3 years ago
bytedance / MRECG
☆35Updated 2 years ago
fmfi-compbio / admm-pruning
☆28Updated 11 months ago
VainF / Isomorphic-Pruning
[ECCV 2024] Isomorphic Pruning for Vision Models
☆70Updated 11 months ago
ModelTC / L2_Compression
☆13Updated last year
kriskrisliu / NoisyQuant
An official implement of CVPR 2023 paper - NoisyQuant: Noisy Bias-Enhanced Post-Training Activation Quantization for Vision Transformers
☆21Updated last year
ModelTC / Outlier_Suppression_Plus
Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…
☆46Updated last year
GATECH-EIC / SuperTickets
[ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning
☆20Updated 3 years ago