mlzxy / qsparse
Train neural networks with joint quantization and pruning on both weights and activations using any pytorch modules
☆40Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for qsparse
- Codes for Accepted Paper : "MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization" in NeurIPS 2019☆54Updated 4 years ago
- Post-training sparsity-aware quantization☆33Updated last year
- ☆35Updated 4 years ago
- 3rd place solution for NeurIPS 2019 MicroNet challenge☆35Updated 5 years ago
- Code for ICML 2021 submission☆35Updated 3 years ago
- [CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy☆156Updated 4 years ago
- ☆46Updated 4 years ago
- This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…☆41Updated 6 years ago
- [CVPR 2021] Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator☆38Updated 2 years ago
- source code of the paper: Robust Quantization: One Model to Rule Them All☆37Updated last year
- [ICLR 2021] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vi…☆30Updated 8 months ago
- Official PyTorch Implementation of "Learning Architectures for Binary Networks" (ECCV2020)☆26Updated 4 years ago
- All about acceleration and compression of Deep Neural Networks☆33Updated 5 years ago
- Implementation for the paper "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization"☆73Updated 4 years ago
- ☆11Updated 2 years ago
- ☆28Updated 4 years ago
- This repository represents training examples for the CVPR 2018 paper "SYQ:Learning Symmetric Quantization For Efficient Deep Neural Netwo…☆32Updated 5 years ago
- Simple Training and Deployment of Fast End-to-End Binary Networks☆159Updated 2 years ago
- ☆47Updated 4 years ago
- Collections of model quantization algorithms. Any issues, please contact Peng Chen (blueardour@gmail.com)☆68Updated 3 years ago
- PyTorch implementation of Towards Efficient Training for Neural Network Quantization☆15Updated 4 years ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 2 years ago
- BitSplit Post-trining Quantization☆47Updated 2 years ago
- A Unified, Systematic Framework of Structured Weight Pruning for DNNs☆22Updated 6 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆95Updated 2 years ago
- [NeurIPS 2020] ShiftAddNet: A Hardware-Inspired Deep Network☆69Updated 4 years ago
- This repository implements the paper "Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations"☆20Updated 3 years ago
- ☆53Updated 5 years ago
- Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020☆20Updated 4 years ago
- Code for the ICLR2020 "Training Binary Neural Networks with Real-to-Binary Convolutions☆35Updated 4 years ago