tianyic / only_train_once_personal_footprint
OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM
☆296Updated 4 months ago
Alternatives and similar repositories for only_train_once_personal_footprint:
Users that are interested in only_train_once_personal_footprint are comparing it to the libraries listed below
- This is a collection of our zero-cost NAS and efficient vision applications.☆393Updated last year
- Pytorch implementation of BRECQ, ICLR 2021☆261Updated 3 years ago
- [NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers☆177Updated last year
- ☆197Updated 3 years ago
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆317Updated last year
- ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting (ICCV 2021)☆293Updated 2 years ago
- 针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库☆245Updated last year
- A simple network quantization demo using pytorch from scratch.☆517Updated last year
- Model Quantization Benchmark☆779Updated this week
- Post-Training Quantization for Vision transformers.☆199Updated 2 years ago
- Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".☆109Updated last year
- ☆223Updated 3 years ago
- Offline Quantization Tools for Deploy.☆119Updated last year
- ☆220Updated 2 years ago
- Code repo for the paper "LLM-QAT Data-Free Quantization Aware Training for Large Language Models"☆265Updated 4 months ago
- Official repo of RepOptimizers and RepOpt-VGG☆260Updated last year
- EfficientFormerV2 [ICCV 2023] & EfficientFormer [NeurIPs 2022]☆1,006Updated last year
- Official code for our CVPR'22 paper “Vision Transformer Slimming: Multi-Dimension Searching in Continuous Optimization Space”☆247Updated last year
- A model compression and acceleration toolbox based on pytorch.☆328Updated last year
- Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.☆141Updated 4 months ago
- A library for researching neural networks compression and acceleration methods.☆138Updated 4 months ago
- ☆206Updated 4 months ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆60Updated 5 months ago
- The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…☆114Updated last year
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆71Updated 2 years ago
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆422Updated last year
- [CVPR'20] ZeroQ: A Novel Zero Shot Quantization Framework☆275Updated last year
- TensorRT 2022复赛方案: 首个基于Transformer的图像重建模型MST++的TensorRT模型推断优化☆137Updated 2 years ago
- A parser, editor and profiler tool for ONNX models.☆411Updated last week
- The official implementation of the NeurIPS 2022 paper Q-ViT.☆85Updated last year