ciodar / deep-compression
PyTorch Lightning implementation of the paper Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding. This repository allows to reproduce the main findings of the paper on MNIST and Imagenette datasets.
☆22Updated 3 months ago
Related projects: ⓘ
- ☆30Updated 9 months ago
- Torch2Chip (MLSys, 2024)☆49Updated 3 weeks ago
- ☆23Updated 2 years ago
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆30Updated last year
- ☆10Updated last year
- ☆15Updated 3 months ago
- [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…☆54Updated 6 months ago
- Official PyTorch implementation of paper "Variation-aware Vision Transformer Quantization"☆33Updated 3 months ago
- Curated content for DNN approximation, acceleration ... with a focus on hardware accelerator and deployment☆23Updated 4 months ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆19Updated 2 years ago
- Official implementation for paper LIMPQ, "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance", ECCV 2022☆44Updated last year
- ☆18Updated last year
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆29Updated 9 months ago
- Quantization in the Jagged Loss Landscape of Vision Transformers☆11Updated 11 months ago
- [IJCAI'22 Survey] Recent Advances on Neural Network Pruning at Initialization.☆57Updated 11 months ago
- ☆42Updated 7 months ago
- Official implementation of Neurips 2020 "Sparse Weight Activation Training" paper.☆26Updated 3 years ago
- BinaryViT: Pushing Binary Vision Transformers Towards Convolutional Models☆26Updated 7 months ago
- [ICLR 2022] "Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining" by Lu Miao*, Xiaolong Luo*, T…☆29Updated 2 years ago
- A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).☆22Updated 2 years ago
- ☆17Updated last year
- Position-based Scaled Gradient for Model Quantization and Pruning Code (NeurIPS 2020)☆26Updated 3 years ago
- A collection of research papers on efficient training of DNNs☆67Updated 2 years ago
- [NeurIPS 2022] “M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design”, Hanxue …☆90Updated last year
- We have implemented a framework that supports developers to structured prune neural networks of Tensorflow Models☆26Updated 2 years ago
- Reproducing Quantization paper PACT☆55Updated 2 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆14Updated 2 years ago
- [CVPR 2024] Code for A&B BNN: Add&Bit-Operation-Only Hardware-Friendly Binary Neural Network☆12Updated 3 months ago
- Personal Digest of NAS (Under Construction 🛠)☆25Updated 3 years ago
- TBNv2: Convolutional Neural Network With Ternary Inputs and Binary Weights☆15Updated 4 years ago