tianyic / only_train_once_personal_footprintView external linksLinks
OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM
☆310Sep 16, 2024Updated last year
Alternatives and similar repositories for only_train_once_personal_footprint
Users that are interested in only_train_once_personal_footprint are comparing it to the libraries listed below
Sorting:
- OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM☆50Oct 10, 2024Updated last year
- [T-PAMI'23] PAGCP for the compression of YOLOv5☆122Apr 13, 2023Updated 2 years ago
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,257Sep 7, 2025Updated 5 months ago
- [ICLR 2023] PyTorch code for DFPC: Data flow driven pruning of coupled channels without data.☆15Aug 25, 2023Updated 2 years ago
- An implementation of <Group Fisher Pruning for Practical Network Compression> based on pytorch and mmcv☆18Nov 21, 2021Updated 4 years ago
- [NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers☆192Feb 28, 2023Updated 2 years ago
- A curated list of neural network pruning resources.☆2,490Apr 4, 2024Updated last year
- [NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support Llama-3/3.1, Llama-2, LLaMA, BLOOM, Vicuna, Baich…☆1,106Oct 7, 2024Updated last year
- To appear in the 11th International Conference on Learning Representations (ICLR 2023).☆18Feb 24, 2023Updated 2 years ago
- Model Quantization Benchmark☆858Apr 20, 2025Updated 9 months ago
- A generic code base for neural network pruning, especially for pruning at initialization.☆31Sep 3, 2022Updated 3 years ago
- Structural Pruning for LLaMA☆54May 20, 2023Updated 2 years ago
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)☆31Aug 15, 2024Updated last year
- Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"☆321Mar 4, 2025Updated 11 months ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆81Jul 23, 2024Updated last year
- [ICLR 2022] "Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining" by Lu Miao*, Xiaolong Luo*, T…☆33Jan 20, 2022Updated 4 years ago
- BESA is a differentiable weight pruning technique for large language models.☆17Mar 4, 2024Updated last year
- ☆21Oct 1, 2024Updated last year
- Awesome Pruning. ✅ Curated Resources for Neural Network Pruning.☆174Aug 30, 2024Updated last year
- [NeurIPS 2023] Structural Pruning for Diffusion Models☆216Jul 8, 2024Updated last year
- OpenMMLab Model Compression Toolbox and Benchmark.☆1,661Jun 11, 2024Updated last year
- ☆29Jun 11, 2023Updated 2 years ago
- Improving Post Training Neural Quantization: Layer-wise Calibration and Integer Programming☆35Jun 29, 2023Updated 2 years ago
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆360Apr 11, 2023Updated 2 years ago
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆18Dec 5, 2024Updated last year
- [ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment☆1,939Dec 14, 2023Updated 2 years ago
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,781Mar 28, 2024Updated last year
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆55Dec 1, 2023Updated 2 years ago
- An onnx-based quantitation tool.☆71Jan 8, 2024Updated 2 years ago
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆51Jul 10, 2023Updated 2 years ago
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆864Dec 24, 2025Updated last month
- Knowledge Distillation Toolbox for Semantic Segmentation☆17Nov 20, 2022Updated 3 years ago
- Group Fisher Pruning for Practical Network Compression(ICML2021)☆161May 24, 2023Updated 2 years ago
- PyTorch implementation of "Dynamic Structure Pruning for Compressing CNNs" (AAAI 2023 Oral)☆27Jan 15, 2024Updated 2 years ago
- [ICCV-2023] EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization☆28Dec 6, 2023Updated 2 years ago
- A model compression and acceleration toolbox based on pytorch.☆333Jan 12, 2024Updated 2 years ago
- This is a collection of our zero-cost NAS and efficient vision applications.☆448Aug 21, 2023Updated 2 years ago
- An official implementation of the paper "How Sparse Can We Prune A Deep Network: A Fundamental Limit Viewpoint".☆29Nov 13, 2024Updated last year
- IntLLaMA: A fast and light quantization solution for LLaMA☆18Jul 21, 2023Updated 2 years ago