uber-research / permute-quantize-finetune
Using ideas from product quantization for state-of-the-art neural network compression.
☆146Updated 3 years ago
Alternatives and similar repositories for permute-quantize-finetune:
Users that are interested in permute-quantize-finetune are comparing it to the libraries listed below
- code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"☆104Updated 3 years ago
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆276Updated last year
- A PyTorch implementation of "Incremental Network Quantization: Towards Lossless CNNs with Low-Precision Weights"☆167Updated 5 years ago
- AlphaNet Improved Training of Supernet with Alpha-Divergence☆98Updated 3 years ago
- Class Project for 18663 - Implementation of FBNet (Hardware-Aware DNAS)☆34Updated 5 years ago
- A research library for pytorch-based neural network pruning, compression, and more.☆160Updated 2 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆94Updated 2 years ago
- Code for https://arxiv.org/abs/1810.04622☆140Updated 5 years ago
- CNN channel pruning, LeGR, MorphNet, AMC. Codebase for paper "LeGR: Filter Pruning via Learned Global Ranking"☆114Updated 5 years ago
- [CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy☆157Updated 4 years ago
- ☆35Updated 5 years ago
- [ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, …☆167Updated 3 years ago
- ☆56Updated 4 years ago
- Codes for Accepted Paper : "MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization" in NeurIPS 2019☆54Updated 4 years ago
- [CVPR 2021] Searching by Generating: Flexible and Efficient One-Shot NAS with Architecture Generator☆38Updated 2 years ago
- A Pytorch implementation of Neural Network Compression (pruning, deep compression, channel pruning)☆155Updated 8 months ago
- ☆67Updated 5 years ago
- Pytorch implementation of our paper accepted by NeurIPS 2020 -- Rotated Binary Neural Network☆81Updated 2 years ago
- A package to make do Network Slimming a little easier☆47Updated 3 years ago
- ☆70Updated 5 years ago
- source code of the paper: Robust Quantization: One Model to Rule Them All☆40Updated last year
- Some recent Quantizing techniques on PyTorch☆72Updated 5 years ago
- Neural Architecture Transfer (Arxiv'20), PyTorch Implementation☆156Updated 4 years ago
- Official implementation of "UNAS: Differentiable Architecture Search Meets Reinforcement Learning", CVPR 2020 Oral☆60Updated last year
- Algorithm-hardware Co-design for Deformable Convolution☆24Updated 4 years ago
- [ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…☆30Updated last year
- 3rd place solution for NeurIPS 2019 MicroNet challenge☆35Updated 5 years ago
- Batch normalization fusion for PyTorch☆196Updated 4 years ago
- PyTorch reimplementation of RegNet (Design Space Design, CVPR2020) on CIFAR10 and ImageNet☆47Updated 4 years ago
- Soft Threshold Weight Reparameterization for Learnable Sparsity☆88Updated 2 years ago