tobna / WhatTransformerToFavorLinks
Github repository for the paper Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers.
☆30Updated 6 months ago
Alternatives and similar repositories for WhatTransformerToFavor
Users that are interested in WhatTransformerToFavor are comparing it to the libraries listed below
Sorting:
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆106Updated 2 years ago
- [ECCV 2022] Implementation of the paper "Locality Guidance for Improving Vision Transformers on Tiny Datasets"☆81Updated 3 years ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆84Updated last year
- The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://ar…☆44Updated 9 months ago
- ☆44Updated 2 years ago
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆20Updated 2 years ago
- ☆23Updated last year
- Code for You Only Cut Once: Boosting Data Augmentation with a Single Cut, ICML 2022.☆105Updated 2 years ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆91Updated 2 years ago
- ResMLP: Feedforward networks for image classification with data-efficient training☆45Updated 4 years ago
- [ICML2024] DetKDS: Knowledge Distillation Search for Object Detectors☆15Updated last year
- ☆28Updated last year
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆76Updated last year
- Scattering Vision Transformer☆53Updated last year
- Learning Efficient Vision Transformers via Fine-Grained Manifold Distillation. NeurIPS 2022.☆32Updated 2 years ago
- ☆27Updated 2 years ago
- The official implementation for ALOFT (CVPR 2023).☆56Updated 2 years ago
- Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"☆66Updated 3 years ago
- Official implementation for "SimA: Simple Softmax-free Attention for Vision Transformers"☆45Updated last year
- ☆47Updated 2 years ago
- [BMVC 2022] Official repository for "How to Train Vision Transformer on Small-scale Datasets?"☆162Updated last year
- This repository contains the pytorch code for our work IEEE ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training…☆86Updated 11 months ago
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆69Updated last year
- Official Pytorch implementation of Super Vision Transformer (IJCV)☆43Updated 2 years ago
- ☆63Updated 4 years ago
- Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022☆149Updated 2 years ago
- Official code for Scale Decoupled Distillation☆41Updated last year
- ☆42Updated last year
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago
- Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer