uber-research / permute-quantize-finetune
Using ideas from product quantization for state-of-the-art neural network compression.
☆146Updated 3 years ago
Alternatives and similar repositories for permute-quantize-finetune:
Users that are interested in permute-quantize-finetune are comparing it to the libraries listed below
- A Pytorch implementation of Neural Network Compression (pruning, deep compression, channel pruning)☆154Updated 9 months ago
- ☆182Updated 2 years ago
- CNN channel pruning, LeGR, MorphNet, AMC. Codebase for paper "LeGR: Filter Pruning via Learned Global Ranking"☆114Updated 5 years ago
- A PyTorch implementation of "Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights"☆167Updated 5 years ago
- [CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy☆157Updated 4 years ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆276Updated last year
- ☆35Updated 5 years ago
- Repository to track the progress in model compression and acceleration☆105Updated 3 years ago
- source code of the paper: Robust Quantization: One Model to Rule Them All☆40Updated 2 years ago
- PyTorch reimplementation of RegNet (Design Space Design, CVPR2020) on CIFAR10 and ImageNet☆47Updated 4 years ago
- 3rd place solution for NeurIPS 2019 MicroNet challenge☆35Updated 5 years ago
- Codes for Accepted Paper : "MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization" in NeurIPS 2019☆54Updated 4 years ago
- Class Project for 18663 - Implementation of FBNet (Hardware-Aware DNAS)☆34Updated 5 years ago
- ☆69Updated 5 years ago
- AlphaNet Improved Training of Supernet with Alpha-Divergence☆98Updated 3 years ago
- code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"☆104Updated 3 years ago
- Implementation of knapsack pruning☆27Updated 4 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆71Updated 2 years ago
- ☆67Updated 5 years ago
- ☆56Updated 4 years ago
- [ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…☆30Updated last year
- Some recent Quantizing techniques on PyTorch☆72Updated 5 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆94Updated 2 years ago
- Algorithm-hardware Co-design for Deformable Convolution☆24Updated 4 years ago
- LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks☆242Updated 2 years ago
- ☆47Updated 4 years ago
- Graph Transforms to Quantize and Retrain Deep Neural Nets in TensorFlow☆168Updated 5 years ago
- Pytorch implementation of our paper accepted by NeurIPS 2020 -- Rotated Binary Neural Network☆81Updated 2 years ago
- BMXNet 2: An Open-Source Binary Neural Network Implementation Based on MXNet☆232Updated 2 years ago
- Example for applying Gaussian and Laplace clipping on activations of CNN.☆34Updated 6 years ago