Nota-NetsPresso / SNPLinks
Structured Neuron Level Pruning to compress Transformer-based models [ECCV'24]
☆16Updated last year
Alternatives and similar repositories for SNP
Users that are interested in SNP are comparing it to the libraries listed below
Sorting:
- A library for training, compressing and deploying computer vision models (including ViT) with edge devices☆73Updated 3 months ago
- ☆90Updated last year
- The official NetsPresso Python package.☆47Updated last month
- Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]☆89Updated last year
- OwLite is a low-code AI model compression toolkit for AI models.☆51Updated last month
- ☆56Updated 3 years ago
- [ICLR 2024] The Need for Speed: Pruning Transformers with One Recipe☆31Updated last year
- It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher [CVPR 2022 Oral]☆29Updated 3 years ago
- A performance library for machine learning applications.☆185Updated 2 years ago
- Awesome Pruning. ✅ Curated Resources for Neural Network Pruning.☆172Updated last year
- Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples [NeurIPS 2021]☆33Updated 4 years ago
- Layer-wise Pruning of Transformer Heads for Efficient Language Modeling☆22Updated 3 years ago
- In progress.☆67Updated last year
- Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"☆12Updated 2 months ago
- Implement of Dynamic Model Pruning with Feedback with pytorch☆41Updated 3 years ago
- Recent Advances on Efficient Vision Transformers☆55Updated 2 years ago
- Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.☆48Updated 4 years ago
- 2022_AAAI accepted paper, NaturalInversion:Data-Free Image Synthesis Improving Real-World Consistency☆10Updated 3 years ago
- [NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers☆192Updated 2 years ago
- [ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…☆66Updated last year
- Official PyTorch implementation of "Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming" (ICML'23)☆13Updated last year
- Ensemble Knowledge Guided Sub-network Search and Fine-tuning for Filter Pruning☆19Updated 3 years ago
- Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight…☆63Updated last year
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆50Updated last year
- Official Pytorch implementation of Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference (ICLR …☆54Updated 3 years ago
- [CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"☆45Updated 9 months ago
- read 1 paper everyday (only weekday)☆56Updated 4 years ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆78Updated last year
- [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs☆123Updated 6 months ago
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆34Updated 2 years ago