roymiles / Simple-Recipe-DistillationLinks

[AAAI 2024] Understanding the Role of the Projector in Knowledge Distillation

☆18

Alternatives and similar repositories for Simple-Recipe-Distillation

Users that are interested in Simple-Recipe-Distillation are comparing it to the libraries listed below

Sorting:

Hao840 / OFAKD
PyTorch code and checkpoints release for OFA-KD: https://arxiv.org/abs/2310.19444
☆129Updated last year
WangYZ1608 / Knowledge-Distillation-via-ND
The official implementation for paper: Improving Knowledge Distillation via Regularizing Feature Norm and Direction
☆22Updated 2 years ago
hunto / image_classification_sota
Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.
☆82Updated last year
roymiles / VkD
[CVPR 2024] VkD : Improving Knowledge Distillation using Orthogonal Projections
☆55Updated 9 months ago
VainF / Isomorphic-Pruning
[ECCV 2024] Isomorphic Pruning for Vision Models
☆73Updated last year
Hao840 / vanillaKD
PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781
☆75Updated last year
Jin-Ying / Multi-Level-Logit-Distillation
Code for 'Multi-level Logit Distillation' (CVPR2023)
☆67Updated 10 months ago
hunto / DiffKD
Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023
☆88Updated last year
shicaiwei123 / SDD-CVPR2024
Official code for Scale Decoupled Distillation
☆41Updated last year
guoyang9 / PELA
PELA: Learning Parameter-Efficient Models with Low-Rank Approximation [CVPR 2024]
☆18Updated last year
LeapLabTHU / EfficientTrain
1.5−3.0× lossless training or pre-training speedup. An off-the-shelf, easy-to-implement algorithm for the efficient training of foundatio…
☆222Updated 11 months ago
xinghaochen / SLAB
[ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…
☆107Updated 11 months ago
hunto / DIST_KD
Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022
☆147Updated 2 years ago
rayleizhu / GLMix
[NeurIPS 2024] official code release for our paper "Revisiting the Integration of Convolution and Attention for Vision Backbone".
☆40Updated 6 months ago
ChenhongyiYang / PlainMamba
[BMVC 2024] PlainMamba: Improving Non-hierarchical Mamba in Visual Recognition
☆79Updated 4 months ago
altair199797 / LowFormer
☆29Updated 4 months ago
zhangxiaosong18 / hivit
☆67Updated 2 years ago
Jasonlee1995 / ImageNet-1K
ImageNet-1K data download, processing for using as a dataset
☆106Updated 2 years ago
OPTML-Group / Robust-MoE-CNN
[ICCV23] Robust Mixture-of-Expert Training for Convolutional Neural Networks by Yihua Zhang, Ruisi Cai, Tianlong Chen, Guanhua Zhang, Hua…
☆60Updated last year
MengLcool / AdaViT
[CVPR-22] This is the official implementation of the paper "Adavit: Adaptive vision transformers for efficient image recognition".
☆55Updated 2 years ago
aleemsidra / ConvLoRA
This repository contains the pytorch code for our work IEEE ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training…
☆80Updated 9 months ago
uzh-rpg / svit
Official implementation of "SViT: Revisiting Token Pruning for Object Detection and Instance Segmentation"
☆34Updated last year
JieShibo / PETL-ViT
[ICCV 2023 & AAAI 2023] Binary Adapters & FacT, [Tech report] Convpass
☆192Updated 2 years ago
IemProg / MiMi
🔥 🔥 [WACV2024] Mini but Mighty: Finetuning ViTs with Mini Adapters
☆20Updated last year
kostas1515 / AGLU
[ECCV2024 - Oral] Adaptive Parametric Activation
☆53Updated 5 months ago
GATECH-EIC / Castling-ViT
[CVPR 2023] Castling-ViT: Compressing Self-Attention via Switching Towards Linear-Angular Attention During Vision Transformer Inference
☆30Updated last year
NUS-HPC-AI-Lab / Dynamic-Tuning
The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"
☆46Updated 7 months ago
YuqiYang213 / MLoRE
Project Page for "Multi-Task Dense Prediction via Mixture of Low-Rank Experts"
☆82Updated 2 months ago
GzyAftermath / CAT-KD
CVPR 2023, Class Attention Transfer Based Knowledge Distillation
☆44Updated 2 years ago
khawar-islam / diffuseMix
Official PyTorch implementation of DiffuseMix : Label-Preserving Data Augmentation with Diffusion Models (CVPR'2024)
☆120Updated 4 months ago