[CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy
☆160Jun 16, 2020Updated 5 years ago
Alternatives and similar repositories for apq
Users that are interested in apq are comparing it to the libraries listed below
Sorting:
- [CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆404Feb 26, 2021Updated 5 years ago
- ☆28Oct 21, 2020Updated 5 years ago
- PyTorch implementation of EdMIPS: https://arxiv.org/pdf/2004.05795.pdf☆60Jul 27, 2020Updated 5 years ago
- BitSplit Post-trining Quantization☆50Dec 20, 2021Updated 4 years ago
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆449Nov 22, 2023Updated 2 years ago
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,940Dec 14, 2023Updated 2 years ago
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware☆1,449Aug 30, 2024Updated last year
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆98Jun 10, 2021Updated 4 years ago
- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing☆336Jul 14, 2024Updated last year
- Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020☆22Nov 15, 2020Updated 5 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Jan 3, 2022Updated 4 years ago
- [SIGMETRICS 2022] One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search☆13Nov 3, 2021Updated 4 years ago
- [CVPR 2020] This project is the PyTorch implementation of our accepted CVPR 2020 paper : forward and backward information retention for a…☆181Mar 14, 2020Updated 5 years ago
- ☆265Oct 30, 2019Updated 6 years ago
- Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)☆26Nov 12, 2020Updated 5 years ago
- XNAS: An effective, modular, and flexible Neural Architecture Search (NAS) framework.☆47Jun 29, 2022Updated 3 years ago
- ☆42Dec 15, 2022Updated 3 years ago
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆24Aug 17, 2021Updated 4 years ago
- Model Quantization Benchmark☆858Apr 20, 2025Updated 10 months ago
- ☆166Mar 25, 2023Updated 2 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆16Apr 28, 2021Updated 4 years ago
- (CVPR 2021, Oral) Dynamic Slimmable Network☆231Dec 31, 2021Updated 4 years ago
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,516Jun 7, 2020Updated 5 years ago
- SPOS(Single Path One-Shot Neural Architecture Search with Uniform Sampling) rebuilt in Pytorch with single GPU.☆244Dec 17, 2021Updated 4 years ago
- [JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion☆41Feb 26, 2021Updated 5 years ago
- [ICLR 2022 Oral] F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization☆93May 5, 2022Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆35Jun 29, 2023Updated 2 years ago
- Code for paper "Automatic Neural Network Compression by Sparsity-Quantization Joint Learning: A Constrained Optimization-based Approach"☆18Jul 9, 2020Updated 5 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆453May 15, 2023Updated 2 years ago
- ☆57Dec 8, 2020Updated 5 years ago
- code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"☆105Sep 29, 2021Updated 4 years ago
- [CVPR 2021]OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection☆45Feb 21, 2024Updated 2 years ago
- Neural Network Quantization With Fractional Bit-widths☆11Feb 19, 2021Updated 5 years ago
- BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing☆149Dec 25, 2019Updated 6 years ago
- Binary Neural Network on IceStick FPGA.☆54Jul 11, 2018Updated 7 years ago
- Automated deep learning algorithms implemented in PyTorch.☆1,586Apr 24, 2022Updated 3 years ago
- (CVPR 2020) Block-wisely Supervised Neural Architecture Search with Knowledge Distillation☆237Sep 23, 2021Updated 4 years ago
- The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)☆139Nov 19, 2020Updated 5 years ago
- pytorch implementation of "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"☆128Jan 2, 2020Updated 6 years ago