ciodar / deep-compressionLinks
PyTorch Lightning implementation of the paper Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding. This repository allows to reproduce the main findings of the paper on MNIST and Imagenette datasets.
☆35Updated last year
Alternatives and similar repositories for deep-compression
Users that are interested in deep-compression are comparing it to the libraries listed below
Sorting:
- Binarize convolutional neural networks using pytorch☆149Updated 3 years ago
- [IJCAI'22 Survey] Recent Advances on Neural Network Pruning at Initialization.☆59Updated 2 years ago
- Bibtex for Sparsity in Deep Learning paper (https://arxiv.org/abs/2102.00554) - open for pull requests☆46Updated 3 years ago
- ☆13Updated 6 months ago
- A collection of research papers on efficient training of DNNs☆70Updated 3 years ago
- ☆44Updated 2 years ago
- We have implemented a framework that supports developers to structured prune neural networks of Tensorflow Models☆28Updated last year
- Torch2Chip (MLSys, 2024)☆55Updated 10 months ago
- The official implementation of TinyTrain [ICML '24]☆24Updated last year
- Recent Advances on Efficient Vision Transformers☆55Updated 3 years ago
- Model compression by constrained optimization, using the Learning-Compression (LC) algorithm☆71Updated 4 years ago
- Soft Threshold Weight Reparameterization for Learnable Sparsity☆91Updated 2 years ago
- Reproducing Quantization paper PACT☆65Updated 3 years ago
- [CVPR 2024] Official implementation for "A&B BNN: Add&Bit-Operation-Only Hardware-Friendly Binary Neural Network"☆24Updated 2 months ago
- ☆78Updated 3 years ago
- Any-Precision Deep Neural Networks (AAAI 2021)☆62Updated 5 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆61Updated 2 years ago
- ☆43Updated 2 years ago
- μNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.☆82Updated 5 years ago
- A runtime fault injection tool for PyTorch☆126Updated last year
- Comparison of method "Pruning at initialization prior to training" (Synflow/SNIP/GraSP) in PyTorch☆17Updated last year
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆30Updated 2 years ago
- In progress.☆68Updated last year
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆64Updated last year
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Updated 4 years ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆70Updated last year
- Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples [NeurIPS 2021]☆34Updated 4 years ago
- ☆25Updated last year
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆33Updated 2 years ago
- ☆25Updated 4 years ago