tobna / WhatTransformerToFavorLinks

Github repository for the paper Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers.

☆30

Alternatives and similar repositories for WhatTransformerToFavor

Users that are interested in WhatTransformerToFavor are comparing it to the libraries listed below

Sorting:

ChenMnZ / CF-ViT
(AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"
☆106Updated 2 years ago
lkhl / tiny-transformers
[ECCV 2022] Implementation of the paper "Locality Guidance for Improving Vision Transformers on Tiny Datasets"
☆81Updated 3 years ago
hunto / image_classification_sota
Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.
☆84Updated last year
JiamingLv / WKD
The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://ar…
☆44Updated 9 months ago
jianlong-yuan / UniNeXt
☆44Updated 2 years ago
OSVAI / NORM
The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…
☆20Updated 2 years ago
pprp / Vision-Mamba-CIFAR10
☆23Updated last year
JunlinHan / YOCO
Code for You Only Cut Once: Boosting Data Augmentation with a Single Cut, ICML 2022.
☆105Updated 2 years ago
ma-xu / FCViT
A Close Look at Spatial Modeling: From Attention to Convolution
☆91Updated 2 years ago
rishikksh20 / ResMLP-pytorch
ResMLP: Feedforward networks for image classification with data-efficient training
☆45Updated 4 years ago
lliai / DetKDS
[ICML2024] DetKDS: Knowledge Distillation Search for Object Detectors
☆15Updated last year
lliai / DisWOT-CVPR2023
☆28Updated last year
Hao840 / vanillaKD
PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781
☆76Updated last year
badripatro / svt
Scattering Vision Transformer
☆53Updated last year
Hao840 / manifold-distillation
Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.
☆32Updated 2 years ago
Daner-Wang / VTC-LFC
☆27Updated 2 years ago
lingeringlight / ALOFT
The official implementation for ALOFT (CVPR 2023).
☆56Updated 2 years ago
szq0214 / SReT
Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"
☆66Updated 3 years ago
UCDvision / sima
Official implementation for "SimA: Simple Softmax-free Attention for Vision Transformers"
☆45Updated last year
megvii-research / TPS-CVPR2023
☆47Updated 2 years ago
hananshafi / vits-for-small-scale-datasets
[BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"
☆162Updated last year
aleemsidra / ConvLoRA
This repository contains the pytorch code for our work IEEE ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training…
☆86Updated 11 months ago
Jin-Ying / Multi-Level-Logit-Distillation
Code for 'Multi-level Logit Distillation' (CVPR2023)
☆69Updated last year
lmbxmu / SuperViT
Official Pytorch implementation of Super Vision Transformer (IJCV)
☆43Updated 2 years ago
coeusguo / ceit
☆63Updated 4 years ago
hunto / DIST_KD
Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022
☆149Updated 2 years ago
shicaiwei123 / SDD-CVPR2024
Official code for Scale Decoupled Distillation
☆41Updated last year
akhtarvision / cal-detr
☆42Updated last year
ziplab / LIT
[AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"
☆97Updated 3 years ago
YifanXu74 / Evo-ViT
Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer
☆73Updated 3 years ago