mary-phuong / multiexit-distillationLinks
☆22Updated 5 years ago
Alternatives and similar repositories for multiexit-distillation
Users that are interested in multiexit-distillation are comparing it to the libraries listed below
Sorting:
- A pytorch implement of scalable neural netowrks.☆23Updated 5 years ago
- ☆47Updated 5 years ago
- Knowledge Transfer via Dense Cross-layer Mutual-distillation (ECCV'2020)☆29Updated 5 years ago
- [ICLR 2020] ”Triple Wins: Boosting Accuracy, Robustness and Efficiency Together by Enabling Input-Adaptive Inference“☆24Updated 3 years ago
- A generic code base for neural network pruning, especially for pruning at initialization.☆31Updated 2 years ago
- Implementation "Adapting Auxiliary Losses Using Gradient Similarity" article☆32Updated 6 years ago
- Code for Active Mixup in 2020 CVPR☆23Updated 3 years ago
- TF-FD☆20Updated 2 years ago
- [ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Che…☆82Updated 3 years ago
- ☆20Updated 2 years ago
- ☆31Updated 5 years ago
- [CVPR 2021] Contrastive Neural Architecture Search with Neural Architecture Comparators☆41Updated 3 years ago
- ☆57Updated 4 years ago
- ☆27Updated 4 years ago
- Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better perfo…☆90Updated 2 years ago
- Codes for Understanding Architectures Learnt by Cell-based Neural Architecture Search☆27Updated 5 years ago
- Code for ViTAS_Vision Transformer Architecture Search☆50Updated 4 years ago
- Codebase for the paper "Beyond BatchNorm: Towards a Unified Understanding of Normalization in Deep Learning"☆17Updated 4 years ago
- [CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jon…☆68Updated 2 years ago
- Codes for paper "Few Shot Network Compression via Cross Distillation", AAAI 2020.☆32Updated 5 years ago
- ☆27Updated 2 years ago
- Code for our ICLR'2021 paper "DrNAS: Dirichlet Neural Architecture Search"☆43Updated 4 years ago
- [NeurIPS 2020] "Once-for-All Adversarial Training: In-Situ Tradeoff between Robustness and Accuracy for Free" by Haotao Wang*, Tianlong C…☆44Updated 3 years ago
- PyTorch implementation of Weighted Batch-Normalization layers☆37Updated 5 years ago
- Codebase for the paper "A Gradient Flow Framework for Analyzing Network Pruning"☆21Updated 4 years ago
- Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020☆21Updated 4 years ago
- Code for CVPR2021 paper: MOOD: Multi-level Out-of-distribution Detection☆38Updated last year
- Data-free knowledge distillation using Gaussian noise (NeurIPS paper)☆15Updated 2 years ago
- Source code for 'Knowledge Distillation via Instance Relationship Graph'☆30Updated 6 years ago
- Paper and Code for "Curriculum Learning by Optimizing Learning Dynamics" (AISTATS 2021)☆19Updated 4 years ago