[CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy
☆161Jun 16, 2020Updated 5 years ago
Alternatives and similar repositories for apq
Users that are interested in apq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆405Feb 26, 2021Updated 5 years ago
- ☆28Oct 21, 2020Updated 5 years ago
- PyTorch implementation of EdMIPS: https://arxiv.org/pdf/2004.05795.pdf☆60Jul 27, 2020Updated 5 years ago
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆450Nov 22, 2023Updated 2 years ago
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,947Dec 14, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Train neural networks with joint quantization and pruning on both weights and activations using any pytorch modules☆42Sep 19, 2022Updated 3 years ago
- Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020☆21Nov 15, 2020Updated 5 years ago
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware☆1,448Aug 30, 2024Updated last year
- BitSplit Post-trining Quantization☆50Dec 20, 2021Updated 4 years ago
- XNAS: An effective, modular, and flexible Neural Architecture Search (NAS) framework.☆47Jun 29, 2022Updated 3 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆98Jun 10, 2021Updated 4 years ago
- ☆267Oct 30, 2019Updated 6 years ago
- Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)☆26Nov 12, 2020Updated 5 years ago
- [ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing☆338Jul 14, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for paper "Automatic Neural Network Compression by Sparsity-Quantization Joint Learning: A Constrained Optimization-based Approach"☆18Jul 9, 2020Updated 5 years ago
- ☆42Dec 15, 2022Updated 3 years ago
- [SIGMETRICS 2022] One Proxy Device Is Enough for Hardware-Aware Neural Architecture Search☆13Nov 3, 2021Updated 4 years ago
- [CVPR 2020] This project is the PyTorch implementation of our accepted CVPR 2020 paper : forward and backward information retention for a…☆180Mar 14, 2020Updated 6 years ago
- ☆166Mar 25, 2023Updated 3 years ago
- The code for our paper "Neural Architecture Search as Program Transformation Exploration"☆16Apr 28, 2021Updated 4 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Jan 3, 2022Updated 4 years ago
- Model Quantization Benchmark☆864Apr 20, 2025Updated 11 months ago
- SPOS(Single Path One-Shot Neural Architecture Search with Uniform Sampling) rebuilt in Pytorch with single GPU.☆244Dec 17, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"☆105Sep 29, 2021Updated 4 years ago
- BISMO: A Scalable Bit-Serial Matrix Multiplication Overlay for Reconfigurable Computing☆149Dec 25, 2019Updated 6 years ago
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,516Jun 7, 2020Updated 5 years ago
- (CVPR 2021, Oral) Dynamic Slimmable Network☆231Dec 31, 2021Updated 4 years ago
- [CVPR 2021]OPANAS: One-Shot Path Aggregation Network Architecture Search for Object Detection☆45Feb 21, 2024Updated 2 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆460May 15, 2023Updated 2 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆36Jun 29, 2023Updated 2 years ago
- pytorch implementation of "Differentiable Soft Quantization: Bridging Full-Precision and Low-Bit Neural Networks"☆129Jan 2, 2020Updated 6 years ago
- ☆19May 28, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Binary Neural Network on IceStick FPGA.☆55Jul 11, 2018Updated 7 years ago
- The PyTorch implementation of Learned Step size Quantization (LSQ) in ICLR2020 (unofficial)☆138Nov 19, 2020Updated 5 years ago
- Single-Path NAS: Designing Hardware-Efficient ConvNets in less than 4 Hours☆395Dec 14, 2020Updated 5 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Jul 7, 2022Updated 3 years ago
- Automated deep learning algorithms implemented in PyTorch.☆1,589Apr 24, 2022Updated 3 years ago
- (CVPR 2020) Block-wisely Supervised Neural Architecture Search with Knowledge Distillation☆235Sep 23, 2021Updated 4 years ago
- (ECCV'2020 Oral)EagleEye: Fast Sub-net Evaluation for Efficient Neural Network Pruning☆309Dec 8, 2022Updated 3 years ago