tobna / WhatTransformerToFavorLinks
Github repository for the paper Which Transformer to Favor: A Comparative Analysis of Efficiency in Vision Transformers.
☆33Updated 9 months ago
Alternatives and similar repositories for WhatTransformerToFavor
Users that are interested in WhatTransformerToFavor are comparing it to the libraries listed below
Sorting:
- EATFormer: Improving Vision Transformer Inspired by Evolutionary Algorithm☆35Updated 3 years ago
- Training ImageNet / CIFAR models with sota strategies and fancy techniques such as ViT, KD, Rep, etc.☆87Updated last year
- ☆28Updated 2 years ago
- The offical implementation of [NeurIPS2024] Wasserstein Distance Rivals Kullback-Leibler Divergence for Knowledge Distillation https://ar…☆49Updated last year
- (AAAI 2023 Oral) Pytorch implementation of "CF-ViT: A General Coarse-to-Fine Method for Vision Transformer"☆106Updated 2 years ago
- A Close Look at Spatial Modeling: From Attention to Convolution☆92Updated 3 years ago
- Pytorch implementation of our paper accepted by ECCV2022 -- Knowledge Condensation Distillation https://arxiv.org/abs/2207.05409☆30Updated 3 years ago
- ☆23Updated last year
- ☆48Updated 2 years ago
- Denoising Masked Autoencoders Help Robust Classification.☆67Updated 2 years ago
- [ECCV 2022] Implementation of the paper "Locality Guidance for Improving Vision Transformers on Tiny Datasets"☆82Updated 3 years ago
- Official Pytorch implementation of Super Vision Transformer (IJCV)☆43Updated 2 years ago
- Official PyTorch implementation of our ECCV 2022 paper "Sliced Recursive Transformer"☆66Updated 3 years ago
- ☆43Updated 2 years ago
- [ICML2024] DetKDS: Knowledge Distillation Search for Object Detectors☆19Updated last year
- [ECCV 2022] AMixer: Adaptive Weight Mixing for Self-attention Free Vision Transformers☆29Updated 3 years ago
- PyTorch code and checkpoints release for VanillaKD: https://arxiv.org/abs/2305.15781☆76Updated 2 years ago
- [ICML 2024] Official PyTorch implementation of "SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-paramete…☆108Updated last year
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆20Updated 2 years ago
- Code for You Only Cut Once: Boosting Data Augmentation with a Single Cut, ICML 2022.☆105Updated 2 years ago
- ☆36Updated 2 years ago
- ResMLP: Feedforward networks for image classification with data-efficient training☆45Updated 4 years ago
- Code for 'Multi-level Logit Distillation' (CVPR2023)☆71Updated last year
- This repository contains the pytorch code for our work IEEE ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training…☆94Updated last year
- Unified Architecture Search with Convolution, Transformer, and MLP (ECCV 2022)☆53Updated 3 years ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆55Updated 8 months ago
- [ECCV 2022] EdgeViT: Competing Light-weight CNNs on Mobile Devices with Vision Transformers☆114Updated 2 years ago
- We propose a lossless compression algorithm based on the NTK matrix for DNN. The compressed network yields asymptotically the same NTK a…☆26Updated 2 years ago
- LoMaR (Efficient Self-supervised Vision Pretraining with Local Masked Reconstruction)☆66Updated 9 months ago
- [AAAI 2022] This is the official PyTorch implementation of "Less is More: Pay Less Attention in Vision Transformers"☆97Updated 3 years ago