papers-submission / structured_transposable_masks
Code for ICML 2021 submission
☆35Updated 4 years ago
Alternatives and similar repositories for structured_transposable_masks:
Users that are interested in structured_transposable_masks are comparing it to the libraries listed below
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"☆28Updated 4 years ago
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆25Updated 3 years ago
- Codes for Accepted Paper : "MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization" in NeurIPS 2019☆54Updated 4 years ago
- BitSplit Post-trining Quantization☆49Updated 3 years ago
- PyTorch implementation of Towards Efficient Training for Neural Network Quantization☆15Updated 5 years ago
- ☆43Updated last year
- ☆25Updated 3 years ago
- source code of the paper: Robust Quantization: One Model to Rule Them All☆40Updated 2 years ago
- ☆75Updated 2 years ago
- Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"☆12Updated last year
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆47Updated last year
- This repository implements the paper "Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations"☆20Updated 3 years ago
- ☆44Updated 3 years ago
- [NeurIPS 2020] ShiftAddNet: A Hardware-Inspired Deep Network☆71Updated 4 years ago
- ☆39Updated 2 years ago
- PyTorch implementation of EdMIPS: https://arxiv.org/pdf/2004.05795.pdf☆58Updated 4 years ago
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆32Updated last year
- [ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…☆30Updated last year
- [TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"☆30Updated 7 months ago
- Binary neural networks developed by Huawei Noah's Ark Lab☆29Updated 4 years ago
- ☆35Updated 5 years ago
- An Tensorflow.keras implementation of Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorizatio…☆10Updated 5 years ago
- ☆28Updated 4 years ago
- Official PyTorch Implementation of "Learning Architectures for Binary Networks" (ECCV2020)☆26Updated 4 years ago
- BitPack is a practical tool to efficiently save ultra-low precision/mixed-precision quantized models.☆51Updated 2 years ago
- The code for Joint Neural Architecture Search and Quantization☆13Updated 5 years ago
- [ICML 2022] ShiftAddNAS: Hardware-Inspired Search for More Accurate and Efficient Neural Networks☆16Updated 2 years ago
- [CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu…☆57Updated 3 years ago
- Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)☆69Updated 3 years ago