WangYZ1608 / Knowledge-Distillation-via-ND
The official implementation for paper: Improving Knowledge Distillation via Regularizing Feature Norm and Direction
☆14Updated last year
Alternatives and similar repositories for Knowledge-Distillation-via-ND:
Users that are interested in Knowledge-Distillation-via-ND are comparing it to the libraries listed below
- ☆83Updated last year
- PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444☆108Updated 9 months ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆82Updated 10 months ago
- [CVPR-2022] Official implementation for "Knowledge Distillation with the Reused Teacher Classifier".☆92Updated 2 years ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆73Updated last year
- [CVPR 2023] This repository includes the official implementation our paper "Masked Autoencoders Enable Efficient Knowledge Distillers"☆103Updated last year
- Official PyTorch Code for "Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?" (https://arxiv.org/abs/2305.12954)☆46Updated last year
- [AAAI 2024] Understanding the Role of the Projector in Knowledge Distillation☆14Updated 11 months ago
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆140Updated 2 years ago
- 'NKD and USKD' (ICCV 2023) and 'ViTKD' (CVPRW 2024)☆221Updated last year
- [AAAI 2023] Official PyTorch Code for "Curriculum Temperature for Knowledge Distillation"☆163Updated last month
- [ECCV 2024] Isomorphic Pruning for Vision Models☆61Updated 5 months ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆79Updated 11 months ago
- CVPR 2023, Class Attention Transfer Based Knowledge Distillation☆37Updated last year
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆59Updated 3 months ago
- Official implementation of paper "Masked Distillation with Receptive Tokens", ICLR 2023.☆68Updated last year
- [ECCV-2022] Official implementation of MixSKD: Self-Knowledge Distillation from Mixup for Image Recognition && Pytorch Implementations of…☆102Updated 2 years ago
- 1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…☆216Updated 4 months ago
- ☆58Updated last year
- ☆23Updated 2 years ago
- ☆25Updated last year
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆50Updated 2 months ago
- Official code for Scale Decoupled Distillation☆37Updated 9 months ago
- The official implementation of LumiNet: The Bright Side of Perceptual Knowledge Distillation https://arxiv.org/abs/2310.03669☆20Updated 10 months ago
- [CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".☆50Updated 2 years ago
- [ICCV 23]An approach to enhance the efficiency of Vision Transformer (ViT) by concurrently employing token pruning and token merging tech…☆92Updated last year
- [CVPR 2023 Highlight] This is the official implementation of "Stitchable Neural Networks".☆246Updated last year
- CrossKD: Cross-Head Knowledge Distillation for Dense Object Detection☆145Updated last year
- ImageNet-1K data download, processing for using as a dataset☆77Updated last year
- Official implementation for paper "LightViT: Towards Light-Weight Convolution-Free Vision Transformers"☆138Updated 2 years ago