zju-SWJ / RLDLinks
Official implementation for "Knowledge Distillation with Refined Logits".
☆21Updated last year
Alternatives and similar repositories for RLD
Users that are interested in RLD are comparing it to the libraries listed below
Sorting:
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆71Updated last year
- Official code for Scale Decoupled Distillation☆43Updated last year
- CVPR 2023, Class Attention Transfer Based Knowledge Distillation☆46Updated 2 years ago
- The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://ar…☆49Updated last year
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆34Updated 3 years ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆69Updated 6 months ago
- ☆28Updated 2 years ago
- ☆17Updated 4 years ago
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆20Updated 2 years ago
- [ICML2024] DetKDS: Knowledge Distillation Search for Object Detectors☆19Updated last year
- [CVPR-2022] Official implementation for "Knowledge Distillation with the Reused Teacher Classifier".☆101Updated 3 years ago
- ☆25Updated last year
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆57Updated last year
- The official implementation for ALOFT (CVPR 2023).☆57Updated 2 years ago
- Official pytorch implementation of NeurIPS 2022 paper, TokenMixup☆48Updated 3 years ago
- 🔥MixPro: Data Augmentation with MaskMix and Progressive Attention Labeling for Vision Transformer [Official, ICLR 2023]☆22Updated 2 years ago
- [TPAMI-2023] Official implementations of L-MCL: Online Knowledge Distillation via Mutual Contrastive Learning for Visual Recognition☆26Updated 2 years ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆87Updated last year
- This is the official code for paper: Token Summarisation for Efficient Vision Transformers via Graph-based Token Propagation☆32Updated 2 years ago
- AMTML-KD: Adaptive Multi-teacher Multi-level Knowledge Distillation☆65Updated 4 years ago
- official source code for the Paper: **Long-tailed Visual Recognition via Gaussian Clouded Logit Adjustment** based on Pytorch.☆45Updated 8 months ago
- Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)☆71Updated last year
- [CVPR 2024] Open-Set Domain Adaptation for Semantic Segmentation☆54Updated last year
- [AAAI 2023] Official PyTorch Code for "Curriculum Temperature for Knowledge Distillation"☆182Updated last year
- Switchable Online Knowledge Distillation☆19Updated last year
- PELA: Learning Parameter-Efficient Models with Low-Rank Approximation [CVPR 2024]☆19Updated last year
- [IJCV2025] https://arxiv.org/abs/2304.04521☆15Updated last year
- [NeurIPS'22] Projector Ensemble Feature Distillation☆30Updated 2 years ago
- You Only Condense Once: Two Rules for Pruning Condensed Datasets (NeurIPS 2023)☆15Updated 2 years ago
- [CVPR-2024] Official implementations of CLIP-KD: An Empirical Study of CLIP Model Distillation☆142Updated 5 months ago