jakc4103 / piecewise-quantization
PyTorch implementation of Near-Lossless Post-Training Quantization of Deep Neural Networks via a Piecewise Linear Approximation
☆21Updated 4 years ago
Related projects: ⓘ
- ☆28Updated 3 years ago
- ☆54Updated 3 years ago
- ☆37Updated this week
- This is an implementation of YOLO using LSQ network quantization method.☆18Updated 2 years ago
- Code for "Fast Sparse ConvNets" CVPR2020 submissions☆13Updated 4 years ago
- Cheng-Hao Tu, Jia-Hong Lee, Yi-Ming Chan and Chu-Song Chen, "Pruning Depthwise Separable Convolutions for MobileNet Compression," Interna…☆16Updated 3 years ago
- source code of the paper: Robust Quantization: One Model to Rule Them All☆37Updated last year
- ☆10Updated this week
- ☆35Updated 4 years ago
- This repository containts the pytorch scripts to train mixed-precision networks for microcontroller deployment, based on the memory contr…☆47Updated 4 months ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2021 -- Network Pruning using Adaptive Exemplar Filters☆21Updated 3 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Updated 3 years ago
- PyTorch implementation of Towards Efficient Training for Neural Network Quantization☆15Updated 4 years ago
- Post-training sparsity-aware quantization☆32Updated last year
- Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)☆26Updated 3 years ago
- Official PyTorch Implementation of "Learning Architectures for Binary Networks" (ECCV2020)☆26Updated 3 years ago
- Class Project for 18663 - Implementation of FBNet (Hardware-Aware DNAS)☆32Updated 4 years ago
- Batch Normalization Auto-fusion for PyTorch☆32Updated 4 years ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆26Updated 2 years ago
- This repository represents training examples for the CVPR 2018 paper "SYQ:Learning Symmetric Quantization For Efficient Deep Neural Netwo…☆32Updated 5 years ago
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆24Updated 3 years ago
- Paper list on model compression and acceleration☆26Updated 5 years ago
- A Unified, Systematic Framework of Structured Weight Pruning for DNNs☆21Updated 6 years ago
- 3rd place solution for NeurIPS 2019 MicroNet challenge☆35Updated 4 years ago
- ☆28Updated 2 years ago
- BitSplit Post-trining Quantization☆46Updated 2 years ago
- Explained QNNPACK Implementation☆20Updated 4 years ago
- Repository containing pruned models and related information☆35Updated 3 years ago
- ☆47Updated 4 years ago