mary-phuong / multiexit-distillationLinks

☆23

Alternatives and similar repositories for multiexit-distillation

Users that are interested in multiexit-distillation are comparing it to the libraries listed below

Sorting:

ArchipLab-LinfengZhang / pytorch-scalable-neural-networks
A pytorch implement of scalable neural netowrks.
☆23Updated 5 years ago
kalviny / IMTA
☆47Updated 5 years ago
dwang181 / active-mixup
Code for Active Mixup in 2020 CVPR
☆23Updated 3 years ago
blackfeather-wang / InfoPro-Pytorch
Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better perfo…
☆90Updated 3 years ago
xuguodong03 / UNIXKD
☆27Updated 2 years ago
szkocot / Adapting-Auxiliary-Losses-Using-Gradient-Similarity
Implementation "Adapting Auxiliary Losses Using Gradient Similarity" article
☆32Updated 6 years ago
VITA-Group / Nasty-Teacher
[ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Che…
☆82Updated 3 years ago
VITA-Group / triple-wins
[ICLR 2020] ”Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference“
☆24Updated 3 years ago
bellymonster / Weighted-Soft-Label-Distillation
☆58Updated 4 years ago
haolibai / APS-channel-search
Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020
☆21Updated 5 years ago
MingSun-Tse / Smile-Pruning
A generic code base for neural network pruning, especially for pruning at initialization.
☆31Updated 3 years ago
xiusu / ViTAS
Code for ViTAS_Vision Transformer Architecture Search
☆50Updated 4 years ago
sundw2014 / DCM
Knowledge Transfer via Dense Cross-layer Mutual-distillation (ECCV'2020)
☆30Updated 5 years ago
EkdeepSLubana / BeyondBatchNorm
Codebase for the paper "Beyond BatchNorm: Towards a Unified Understanding of Normalization in Deep Learning"
☆17Updated 4 years ago
megvii-model / RLNAS
☆20Updated 2 years ago
shuyao95 / Understanding-NAS
Codes for Understanding Architectures Learnt by Cell-based Neural Architecture Search
☆27Updated 5 years ago
lliai / Teacher-free-Distillation
TF-FD
☆20Updated 3 years ago
tianyizhou / DoCL
Paper and Code for "Curriculum Learning by Optimizing Learning Dynamics" (AISTATS 2021)
☆19Updated 4 years ago
EkdeepSLubana / flowandprune
Codebase for the paper "A Gradient Flow Framework for Analyzing Network Pruning"
☆20Updated 4 years ago
VITA-Group / CV_LTH_Pre-training
[CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jon…
☆68Updated 2 years ago
meijieru / fast_advprop
[ICLR 2022]: Fast AdvProp
☆35Updated 3 years ago
chenyaofo / CTNAS
[CVPR 2021] Contrastive Neural Architecture Search with Neural Architecture Comparators
☆40Updated 3 years ago
Piyush-555 / GaussianDistillation
Data-free knowledge distillation using Gaussian noise (NeurIPS paper)
☆15Updated 2 years ago
AnTuo1998 / AE-KD
☆27Updated 4 years ago
LTH14 / FSKD
☆31Updated 5 years ago
VITA-Group / Once-for-All-Adversarial-Training
[NeurIPS 2020] "Once-for-All Adversarial Training: In-Situ Tradeoff between Robustness and Accuracy for Free" by Haotao Wang*, Tianlong C…
☆44Updated 3 years ago
VITA-Group / WeakNAS
[NeurIPS 2021] “Stronger NAS with Weaker Predictors“, Junru Wu, Xiyang Dai, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Ye Yu, Zhangyang W…
☆27Updated 3 years ago
haolibai / Cross-Distillation
Codes for paper "Few Shot Network Compression via Cross Distillation", AAAI 2020.
☆31Updated 5 years ago
szq0214 / S2-BNN
S2-BNN: Bridging the Gap Between Self-Supervised Real and 1-bit Neural Networks via Guided Distribution Calibration (CVPR 2021)
☆64Updated 4 years ago
zju-vipa / TransferbilityFromAttributionMaps
(NeurIPS 2019) Deep Model Transferbility from Attribution Maps
☆20Updated 6 years ago