ciodar / deep-compression
PyTorch Lightning implementation of the paper Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding. This repository allows to reproduce the main findings of the paper on MNIST and Imagenette datasets.
☆27Updated last month
Alternatives and similar repositories for deep-compression:
Users that are interested in deep-compression are comparing it to the libraries listed below
- Binarize convolutional neural networks using pytorch☆139Updated 2 years ago
- ☆74Updated 2 years ago
- A collection of research papers on efficient training of DNNs☆69Updated 2 years ago
- ☆19Updated 10 months ago
- Torch2Chip (MLSys, 2024)☆51Updated this week
- Post-training sparsity-aware quantization☆34Updated last year
- [IJCAI'22 Survey] Recent Advances on Neural Network Pruning at Initialization.☆58Updated last year
- μNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.☆78Updated 4 years ago
- Recent Advances on Efficient Vision Transformers☆49Updated 2 years ago
- Reproducing Quantization paper PACT☆59Updated 2 years ago
- FPGA-based hardware acceleration for dropout-based Bayesian Neural Networks.☆22Updated last year
- Code to implement the experiments in "Post-training Quantization for Neural Networks with Provable Guarantees" by Jinjie Zhang, Yixuan Zh…☆11Updated last year
- [ICLR 2022] "Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining" by Lu Miao*, Xiaolong Luo*, T…☆29Updated 3 years ago
- Official implementation for paper LIMPQ, "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance", ECCV 2022☆51Updated last year
- [Neurips 2021] Sparse Training via Boosting Pruning Plasticity with Neuroregeneration☆31Updated last year
- [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…☆54Updated 10 months ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 3 years ago
- The official implementation of TinyTrain [ICML '24]☆21Updated 6 months ago
- Turning float tensors to binary tensors according to IEEE-754 standard.☆38Updated 5 years ago
- [ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…☆30Updated 10 months ago
- [TMLR] Official PyTorch implementation of paper "Quantization Variation: A New Perspective on Training Transformers with Low-Bit Precisio…☆39Updated 4 months ago
- ☆20Updated 2 years ago
- Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)☆26Updated 4 years ago
- Code for "Structured Sparsity Inducing Adaptive Optimizers for Deep Learning" in PyTorch☆18Updated 3 years ago
- ☆25Updated 3 years ago
- Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.☆142Updated 5 months ago
- Lightweight Neural Architecture Search for Temporal Convolutional Networks at the Edge☆10Updated last year
- Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv☆79Updated 2 years ago
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆32Updated last year
- TBNv2: Convolutional Neural Network With Ternary Inputs and Binary Weights☆17Updated 4 years ago