Zzzzz1 / CSKD
Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICCV2023/html/Zhao_Cumulative_Spatial_Knowledge_Distillation_for_Vision_Transformers_ICCV_2023_paper.html
☆15Updated last year
Alternatives and similar repositories for CSKD:
Users that are interested in CSKD are comparing it to the libraries listed below
- Official code for Scale Decoupled Distillation☆37Updated 9 months ago
- Official PyTorch implementation of PS-KD☆83Updated 2 years ago
- [CVPR-2022] Official implementation for "Knowledge Distillation with the Reused Teacher Classifier".☆92Updated 2 years ago
- CVPR 2023, Class Attention Transfer Based Knowledge Distillation☆37Updated last year
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆59Updated 4 months ago
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆51Updated 2 years ago
- Convolutional Initialization for Data-Efficient Vision Transformers☆14Updated last year
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆140Updated 2 years ago
- ☆25Updated last year
- ☆51Updated last year
- Official implementation of the paper "Function-Consistent Feature Distillation" (ICLR 2023)☆27Updated last year
- Official implementation of the paper "Masked Autoencoders are Efficient Class Incremental Learners"☆39Updated 7 months ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆103Updated last year
- PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444☆110Updated 9 months ago
- [ECCV-2022] Official implementation of MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition && Pytorch Implementations of…☆102Updated 2 years ago
- Switchable Online Knowledge Distillation☆18Updated 3 months ago
- (CVPR2023) official code of Decompose, Adjust, Compose: Effective Normalization by Playing with Frequency for Domain Generalization☆28Updated last year
- The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://ar…☆26Updated last month
- Codes for ECCV2022 paper - contrastive deep supervision☆68Updated 2 years ago
- Official implementation of PCS in essay "Prompt Vision Transformer for Domain Generalization"☆51Updated 2 years ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆82Updated 10 months ago
- ☆25Updated 2 years ago
- ☆58Updated 2 years ago
- (NeurIPS 2022) Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty☆33Updated 10 months ago
- ☆47Updated last year
- Official code of "Generating Instance-level Prompts for Rehearsal-free Continual Learning (ICCV 2023)"☆42Updated last year
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆78Updated last year
- Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)☆65Updated 2 months ago
- [CVPR2023] Global and Local Mixture Consistency Cumulative Learning for Long-tailed Visual Recognitions☆70Updated last year
- The official implementation for ALOFT (CVPR 2023).☆53Updated last year