ZuchniakK / MTKDLinks
Multi-Teacher Knowledge Distillation, code for my PhD dissertation. I used knowledge distillation as a decision-fusion and compressing mechanism for ensemble models.
☆22Updated 2 years ago
Alternatives and similar repositories for MTKD
Users that are interested in MTKD are comparing it to the libraries listed below
Sorting:
- This is the implementation for the ICME-2023 paper (Adaptive Multi-Teacher Knowledge Distillation with Meta-Learning).☆29Updated 2 years ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆84Updated last year
- [AAAI 2023] Official PyTorch Code for "Curriculum Temperature for Knowledge Distillation"☆179Updated 10 months ago
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆149Updated 2 years ago
- Wavelet-Attention CNN for Image Classification☆31Updated 3 years ago
- An official codebase of paper "Revisiting Sparse Convolutional Model for Visual Recognition"☆125Updated 2 years ago
- Re-implementation of Online Label Smoothing.☆20Updated 4 years ago
- The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".☆84Updated 2 years ago
- This resposity maintains a collection of important papers on knowledge distillation (awesome-knowledge-distillation)).☆80Updated 6 months ago
- This is the implementation for the ICASSP-2022 paper (Confidence-Aware Multi-Teacher Knowledge Distillation).☆63Updated 3 years ago
- iFormer: Inception Transformer☆247Updated 2 years ago
- AMTML-KD: Adaptive Multi-teacher Multi-level Knowledge Distillation☆62Updated 4 years ago
- Unofficial Implementation of MLP-Mixer, gMLP, resMLP, Vision Permutator, S2MLP, S2MLPv2, RaftMLP, HireMLP, ConvMLP, AS-MLP, SparseMLP, Co…☆169Updated 3 years ago
- Probabilistic Contrastive Learning for Domain Adaptation☆14Updated last year
- Code for "Dual Focal Loss for Calibration" (ICML 2023)☆32Updated 5 months ago
- A pytorch implementation of paper 'Be Your Own Teacher: Improve the Performance of Convolutional Neural Networks via Self Distillation', …☆181Updated 3 years ago
- 对卷积神经网络提取的每一层特征用t-SNE进行降维可视化☆22Updated 3 years ago
- ☆16Updated 4 years ago
- ☆151Updated last year
- the code for WaveCNet☆134Updated last year
- [NeurIPS2023]Lightweight Vision Transformer with Bidirectional Interaction☆26Updated last year
- AAAI 2022 papers with code☆37Updated 3 years ago
- Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)☆26Updated last year
- ☆63Updated 4 years ago
- Elsevier Templates-Latex☆63Updated 4 months ago
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆69Updated last year
- Code for Paper "Self-Distillation from the Last Mini-Batch for Consistency Regularization"☆43Updated 3 years ago
- Vision Transformers with Hierarchical Attention☆102Updated last month
- ☆56Updated 4 years ago
- [CVPR-2022] Official implementation for "Knowledge Distillation with the Reused Teacher Classifier".☆100Updated 3 years ago