JiamingLv / WKDLinks
The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://arxiv.org/abs/2412.08139
☆49Updated last year
Alternatives and similar repositories for WKD
Users that are interested in WKD are comparing it to the libraries listed below
Sorting:
- Official code for Scale Decoupled Distillation☆43Updated last year
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆71Updated last year
- Official PyTorch(MMCV) implementation of “Adversarial AutoMixup” (ICLR 2024 spotlight)☆71Updated last year
- [CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections☆57Updated last year
- The official implementation for ALOFT (CVPR 2023).☆57Updated 2 years ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆87Updated last year
- The offical implement of ImbSAM (Imbalanced-SAM)☆26Updated last year
- This is an official implementation of our NeurIPS 2022 paper "Bridging the Gap Between Vision Transformers and Convolutional Neural Netwo…☆63Updated 5 months ago
- The official codes of our CVPR-2023 paper: Sharpness-Aware Gradient Matching for Domain Generalization☆79Updated 2 years ago
- [ICLR 2023] Masked Frequency Modeling for Self-Supervised Visual Pre-Training☆81Updated 2 years ago
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆106Updated 2 years ago
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆155Updated 3 years ago
- Codes for ECCV2022 paper - contrastive deep supervision☆69Updated 3 years ago
- Pytorch implementation of Split to Merge: Unifying Separated Modalities for Unsupervised Domain Adaptation (CVPR'24)☆35Updated 4 months ago
- PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444☆135Updated last year
- Official implementation of FullMatch (CVPR2023)☆44Updated 6 months ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆21Updated last year
- ☆45Updated 2 years ago
- ☆56Updated last year
- Switchable Online Knowledge Distillation☆19Updated last year
- CVPR 2023, Class Attention Transfer Based Knowledge Distillation☆46Updated 2 years ago
- Official implementation of the paper "Masked Autoencoders are Efficient Class Incremental Learners"☆45Updated last year
- Convolutional Initialization for Data-Efficient Vision Transformers☆16Updated last month
- 【NeurIPS 2024】Official implementation of "Visual Fourier Prompt Tuning"☆39Updated last year
- [CVPR 2024] Code for our Paper "DeiT-LT: Distillation Strikes Back for Vision Transformer training on Long-Tailed Datasets"☆47Updated last year
- ☆17Updated 4 years ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago
- Official implementation of PCS in essay "Prompt Vision Transformer for Domain Generalization"☆50Updated 3 years ago
- ☆28Updated 2 years ago
- SimMatchV2: Semi-Supervised Learning with Graph Consistency☆22Updated 2 years ago