ciodar / deep-compressionLinks
PyTorch Lightning implementation of the paper Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding. This repository allows to reproduce the main findings of the paper on MNIST and Imagenette datasets.
☆35Updated last year
Alternatives and similar repositories for deep-compression
Users that are interested in deep-compression are comparing it to the libraries listed below
Sorting:
- μNAS is a neural architecture search (NAS) system that designs small-yet-powerful microcontroller-compatible neural networks.☆82Updated 5 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆61Updated 2 years ago
- Binarize convolutional neural networks using pytorch☆149Updated 3 years ago
- Torch2Chip (MLSys, 2024)☆55Updated 10 months ago
- [CVPR 2024] Official implementation for "A&B BNN: Add&Bit-Operation-Only Hardware-Friendly Binary Neural Network"☆24Updated 2 months ago
- [ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark☆114Updated 2 years ago
- ☆13Updated 6 months ago
- ☆78Updated 3 years ago
- A Plug-and-play Lightweight tool for the Inference Optimization of Deep Neural networks☆47Updated 3 months ago
- Curated content for DNN approximation, acceleration ... with a focus on hardware accelerator and deployment☆27Updated last year
- ☆25Updated last year
- Implementation of "NITI: Training Integer Neural Networks Using Integer-only Arithmetic" on arxiv☆90Updated 3 years ago
- Lightweight Neural Architecture Search for Temporal Convolutional Networks at the Edge☆10Updated 2 years ago
- [IJCAI'22 Survey] Recent Advances on Neural Network Pruning at Initialization.☆59Updated 2 years ago
- Differentiable Weightless Neural Networks☆31Updated 10 months ago
- [NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer☆30Updated 2 years ago
- ☆17Updated 3 years ago
- ☆43Updated 2 years ago
- [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…☆56Updated last year
- ☆25Updated 4 years ago
- [ICML 2021] "Auto-NBA: Efficient and Effective Search Over the Joint Space of Networks, Bitwidths, and Accelerators" by Yonggan Fu, Yonga…☆16Updated 4 years ago
- Any-Precision Deep Neural Networks (AAAI 2021)☆62Updated 5 years ago
- Recent Advances on Efficient Vision Transformers☆55Updated 3 years ago
- Reproducing Quantization paper PACT☆65Updated 3 years ago
- An open-sourced PyTorch library for developing energy efficient multiplication-less models and applications.☆14Updated last year
- Code to implement the experiments in "Post-training Quantization for Neural Networks with Provable Guarantees" by Jinjie Zhang, Yixuan Zh…☆11Updated 2 years ago
- Post-training sparsity-aware quantization☆34Updated 2 years ago
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆64Updated last year
- Soft Threshold Weight Reparameterization for Learnable Sparsity☆91Updated 2 years ago
- Bibtex for Sparsity in Deep Learning paper (https://arxiv.org/abs/2102.00554) - open for pull requests☆46Updated 3 years ago