ciodar / deep-compression
PyTorch Lightning implementation of the paper Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding. This repository allows to reproduce the main findings of the paper on MNIST and Imagenette datasets.
☆25Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for deep-compression
- ☆24Updated 2 years ago
- μNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.☆76Updated 3 years ago
- Soft Threshold Weight Reparameterization for Learnable Sparsity☆87Updated last year
- [ICLR 2022] "Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining" by Lu Miao*, Xiaolong Luo*, T…☆29Updated 2 years ago
- tiny-imagenet dataset downloader & reader using tensorflow_datasets (tfds) api☆21Updated 5 years ago
- Recent Advances on Efficient Vision Transformers☆48Updated last year
- Turning float tensors to binary tensors according to IEEE-754 standard.☆37Updated 5 years ago
- [IJCAI'22 Survey] Recent Advances on Neural Network Pruning at Initialization.☆56Updated last year
- A collection of research papers on efficient training of DNNs☆68Updated 2 years ago
- A Plug-and-play Lightweight tool for the Inference Optimization of Deep Neural networks☆37Updated 3 weeks ago
- Official implementation for paper LIMPQ, "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance", ECCV 2022☆47Updated last year
- Post-training sparsity-aware quantization☆33Updated last year
- Binarize convolutional neural networks using pytorch☆134Updated 2 years ago
- CMix-NN: Mixed Low-Precision CNN Library for Memory-Constrained Edge Devices☆39Updated 4 years ago
- Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.☆26Updated 3 years ago
- Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples [NeurIPS 2021]☆30Updated 2 years ago
- In progress.☆65Updated 7 months ago
- ☆17Updated 2 years ago
- Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)☆26Updated 4 years ago
- ☆68Updated 2 years ago
- Model compression by constrained optimization, using the Learning-Compression (LC) algorithm☆69Updated 2 years ago
- ☆24Updated last year
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆31Updated last year
- Reproducing Quantization paper PACT☆56Updated 2 years ago
- ☆19Updated 3 years ago
- Code to implement the experiments in "Post-training Quantization for Neural Networks with Provable Guarantees" by Jinjie Zhang, Yixuan Zh…☆12Updated last year
- [ICLR-2020] Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers.☆31Updated 4 years ago
- Lightweight Neural Architecture Search for Temporal Convolutional Networks at the Edge☆10Updated last year
- Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.☆134Updated 2 months ago
- [ICLR 2021] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yining Ding, Vi…☆30Updated 8 months ago