Zzzzz1 / CSKD
Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICCV2023/html/Zhao_Cumulative_Spatial_Knowledge_Distillation_for_Vision_Transformers_ICCV_2023_paper.html
☆15Updated last year
Alternatives and similar repositories for CSKD:
Users that are interested in CSKD are comparing it to the libraries listed below
- Official code for Scale Decoupled Distillation☆40Updated last year
- Official PyTorch implementation of PS-KD☆85Updated 2 years ago
- [CVPR-2022] Official implementation for "Knowledge Distillation with the Reused Teacher Classifier".☆93Updated 2 years ago
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆63Updated 6 months ago
- Convolutional Initialization for Data-Efficient Vision Transformers☆14Updated last year
- Switchable Online Knowledge Distillation☆18Updated 5 months ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆103Updated last year
- Official implementation of the paper "Function-Consistent Feature Distillation" (ICLR 2023)☆29Updated last year
- (NeurIPS 2022) Understanding Cross-Domain Few-Shot Learning Based on Domain Similarity and Few-Shot Difficulty☆33Updated last year
- CVPR 2023, Class Attention Transfer Based Knowledge Distillation☆43Updated last year
- Codes for ECCV2022 paper - contrastive deep supervision☆69Updated 2 years ago
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆52Updated 2 years ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆74Updated last year
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 2 years ago
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆143Updated 2 years ago
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆105Updated last year
- Code of "Robustifying Token Attention for Vision Transformers"☆17Updated last year
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆52Updated 5 months ago
- PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444☆119Updated last year
- ☆29Updated last month
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆82Updated last year
- [ECCV-2022] Official implementation of MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition && Pytorch Implementations of…☆106Updated 2 years ago
- Pytorch Implementation of Task Adaptive Parameter Sharing for Multi-Task Learning (CVPR 2022)☆24Updated last year
- The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://ar…☆34Updated 4 months ago
- [NeurIPS'22] Projector Ensemble Feature Distillation☆29Updated last year
- ☆26Updated 2 years ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆96Updated 2 years ago
- Code release for Your “An Erudite Fine-Grained Visual Classification Model (CVPR 2023)"☆17Updated last year
- Adaptive Token Sampling for Efficient Vision Transformers (ECCV 2022 Oral Presentation)☆101Updated 11 months ago
- ☆15Updated 6 months ago