snu-mllab / Efficient-CNN-Depth-CompressionLinks

Official PyTorch implementation of "Efficient Latency-Aware CNN Depth Compression via Two-Stage Dynamic Programming" (ICML'23)

☆13

Alternatives and similar repositories for Efficient-CNN-Depth-Compression

Users that are interested in Efficient-CNN-Depth-Compression are comparing it to the libraries listed below

Sorting:

hikvision-research / SAViT
☆13Updated 2 years ago
jaeho-lee / layer-adaptive-sparsity
In progress.
☆66Updated last year
ModelTC / L2_Compression
☆13Updated last year
MingSun-Tse / TPP
[ICLR'23] Trainability Preserving Neural Pruning (PyTorch)
☆34Updated 2 years ago
MAC-AutoML / OMPQ
☆25Updated 3 years ago
NVlabs / SMCP
☆22Updated 3 years ago
1hunters / LIMPQ
Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"
☆62Updated 2 years ago
megvii-research / TPS-CVPR2023
☆47Updated 2 years ago
shawnricecake / search-llm
[NeurIPS 2024] Search for Efficient LLMs
☆15Updated 10 months ago
VITA-Group / Random_Pruning
[ICLR 2022] The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training by Shiwei Liu, Tianlo…
☆77Updated 2 years ago
HuangOwen / QAT-ACS
[TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"
☆35Updated last year
ziplab / SAQ
This is the official PyTorch implementation for "Sharpness-aware Quantization for Deep Neural Networks".
☆43Updated 3 years ago
GATECH-EIC / ShiftAddViT
[NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
☆30Updated last year
CownowAn / DaSS
Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)
☆26Updated last year
Qualcomm-AI-research / oscillations-qat
☆78Updated 3 years ago
ThisisBillhe / torch_quantizer
torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.
☆22Updated last year
yaozhewei / HAP
☆43Updated last year
ziplab / EcoFormer
[NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"
☆74Updated 3 years ago
MingSun-Tse / Why-the-State-of-Pruning-so-Confusing
[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…
☆40Updated 2 months ago
sseung0703 / EKG
Ensemble Knowledge Guided Sub-network Search and Fine-tuning for Filter Pruning
☆19Updated 3 years ago
zysxmu / IntraQ
Pytorch implementation of our paper accepted by CVPR 2022 -- IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Sh…
☆33Updated 3 years ago
aaronserianni / training-free-nas
[ACL'22] Training-free Neural Architecture Search for RNNs and Transformers
☆13Updated last year
OpenGVLab / LLMPrune-BESA
BESA is a differentiable weight pruning technique for large language models.
☆17Updated last year
ModelTC / QLLM
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…
☆39Updated last year
MingSun-Tse / Awesome-Efficient-ViT
Recent Advances on Efficient Vision Transformers
☆55Updated 2 years ago
facebookresearch / DepthShrinker
[ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …
☆72Updated 3 years ago
chenbong / PSS-Net
☆17Updated 3 years ago
lliai / EMQ-series
[ICCV-2023] EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization
☆28Updated last year
IST-DASLab / spdy
Code for ICML 2022 paper "SPDY: Accurate Pruning with Speedup Guarantees"
☆20Updated 2 years ago
thu-nics / MBQ
The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"
☆66Updated 8 months ago