sutd-visual-computing-group / LS-KD-compatibilityLinks
[ICML 2022] This work investigates the compatibility between label smoothing (LS) and knowledge distillation (KD). We suggest to use an LS-trained teacher with a low-temperature transfer to render high performance students.
☆11Updated 2 years ago
Alternatives and similar repositories for LS-KD-compatibility
Users that are interested in LS-KD-compatibility are comparing it to the libraries listed below
Sorting:
- ☆14Updated 2 years ago
- Information Bottleneck Approach to Spatial Attention Learning, IJCAI2021☆15Updated 4 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Updated 2 years ago
- TF-FD☆20Updated 2 years ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Updated 2 years ago
- Dual Adaptive Representation Alignment for Cross-domain Few-shot Learning TPAMI 2023☆20Updated last year
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 3 years ago
- ☆29Updated 3 years ago
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆20Updated last year
- Repository containing code for blockwise SSL training☆29Updated 7 months ago
- Prior Knowledge Guided Unsupervised Domain Adaptation (ECCV 2022)☆17Updated 2 years ago
- ☆19Updated 3 years ago
- ☆12Updated 3 years ago
- Switchable Online Knowledge Distillation☆18Updated 7 months ago
- ☆10Updated last year
- NeurIPS 2022: Estimating Noise Transition Matrix with Label Correlations for Noisy Multi-Label Learning☆17Updated 2 years ago
- Pytorch implementation (TPAMI 2023) - Training Compact CNNs for Image Classification using Dynamic-coded Filter Fusion☆19Updated 2 years ago
- Learning with Noisy Labels, Label Noise, ICML 2021☆44Updated 2 years ago
- Official codebase for our paper "Joslim: Joint Widths and Weights Optimization for Slimmable Neural Networks"☆12Updated 3 years ago
- ☆13Updated 2 years ago
- 🔥MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer [Official, ICLR 2023]☆21Updated last year
- Code for the ICML 2021 paper "Sharing Less is More: Lifelong Learning in Deep Networks with Selective Layer Transfer"☆11Updated 3 years ago
- Learning from Limited and Imperfect Data (L2ID): Classification Challenges☆18Updated 4 years ago
- [BMVC 2022] Information Theoretic Representation Distillation☆18Updated last year
- ☆13Updated 3 years ago
- Code for "Dual Focal Loss for Calibration" (ICML 2023)☆32Updated last month
- This is the implementation of our CVPR'23 paper "Class-Conditional Sharpness-Aware Minimization for Deep Long-Tailed Recognition".☆20Updated last year
- Official implementation of the paper "Function-Consistent Feature Distillation" (ICLR 2023)☆29Updated last year
- This repo is the official megengine implementation of the ECCV2022 paper: Efficient One Pass Self-distillation with Zipf's Label Smoothin…☆26Updated 2 years ago
- [ICASSP 2020] Code release of paper 'Heterogeneous Domain Generalization via Domain Mixup'☆26Updated 4 years ago