mary-phuong / multiexit-distillationLinks
☆23Updated 6 years ago
Alternatives and similar repositories for multiexit-distillation
Users that are interested in multiexit-distillation are comparing it to the libraries listed below
Sorting:
- ☆47Updated 6 years ago
- Code for ViTAS_Vision Transformer Architecture Search☆51Updated 4 years ago
- Code for Active Mixup in 2020 CVPR☆23Updated 3 years ago
- [NeurIPS 2021] “Stronger NAS with Weaker Predictors“, Junru Wu, Xiyang Dai, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Ye Yu, Zhangyang W…☆27Updated 3 years ago
- A pytorch implement of scalable neural netowrks.☆23Updated 5 years ago
- [ICLR 2020] ”Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference“☆24Updated 4 years ago
- [CVPR 2021] Contrastive Neural Architecture Search with Neural Architecture Comparators☆40Updated 3 years ago
- Code for our ICLR'2021 paper "DrNAS: Dirichlet Neural Architecture Search"☆43Updated 4 years ago
- [ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Che…☆82Updated 4 years ago
- Knowledge Transfer via Dense Cross-layer Mutual-distillation (ECCV'2020)☆30Updated 5 years ago
- Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020☆22Updated 5 years ago
- ☆20Updated 2 years ago
- Codes for paper "Few Shot Network Compression via Cross Distillation", AAAI 2020.☆31Updated 5 years ago
- A generic code base for neural network pruning, especially for pruning at initialization.☆31Updated 3 years ago
- ☆27Updated 3 years ago
- Paper and Code for "Curriculum Learning by Optimizing Learning Dynamics" (AISTATS 2021)☆19Updated 4 years ago
- [NeurIPS'2019] Shupeng Gui, Haotao Wang, Haichuan Yang, Chen Yu, Zhangyang Wang, Ji Liu, “Model Compression with Adversarial Robustness: …☆49Updated 4 years ago
- [NeurIPS 2020] "Once-for-All Adversarial Training: In-Situ Tradeoff between Robustness and Accuracy for Free" by Haotao Wang*, Tianlong C…☆44Updated 4 years ago
- Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better perfo…☆90Updated 3 years ago
- Codes for Understanding Architectures Learnt by Cell-based Neural Architecture Search☆28Updated 5 years ago
- TF-FD☆20Updated 3 years ago
- Implementation for NAT.☆58Updated 5 years ago
- ☆57Updated 4 years ago
- Paper collection about model compression and acceleration: Pruning, Quantization, Knowledge Distillation, Low Rank Factorization, etc☆25Updated 5 years ago
- Code for the paper "Training CNNs with Selective Allocation of Channels" (ICML 2019)☆25Updated 6 years ago
- "Understanding and Accelerating Neural Architecture Search with Training-Free and Theory-Grounded Metrics" by Wuyang Chen, Xinyu Gong, Yu…☆27Updated 2 years ago
- Implementation "Adapting Auxiliary Losses Using Gradient Similarity" article☆32Updated 6 years ago
- ☆27Updated 4 years ago
- Data-free knowledge distillation using Gaussian noise (NeurIPS paper)☆15Updated 2 years ago
- Accompanying code for the paper "Zero-shot Knowledge Transfer via Adversarial Belief Matching"☆144Updated 5 years ago