sail-sg / dualformerLinks
☆31Updated 2 years ago
Alternatives and similar repositories for dualformer
Users that are interested in dualformer are comparing it to the libraries listed below
Sorting:
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 3 years ago
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated last year
- ☆20Updated 2 years ago
- Lightweight Transformer for Multi-modal Tasks☆16Updated 2 years ago
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 3 years ago
- ☆31Updated 3 years ago
- ☆16Updated 2 years ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆53Updated 2 months ago
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Updated 3 years ago
- Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper☆30Updated 2 years ago
- code base for vision transformers☆36Updated 3 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 4 years ago
- Turning to Video for Transcript Sorting☆48Updated last year
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 3 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆78Updated 2 years ago
- i-mae Pytorch Repo☆20Updated last year
- Teach-DETR: Better Training DETR with Teachers☆31Updated last year
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated 2 years ago
- ☆33Updated 2 years ago
- Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification. ECCV 2022.☆18Updated 3 years ago
- Official pytorch implementation of NeurIPS 2022 paper, TokenMixup☆48Updated 2 years ago
- ☆72Updated last year
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆56Updated last year
- Official Code Release for Container : Context Aggregation Network☆46Updated 3 years ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆79Updated last year
- [CVPR 2022] Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging☆48Updated last year
- ☆22Updated 3 years ago
- Learning to Mask and Permute Visual Tokens for Vision Transformer Pre-Training☆16Updated 2 weeks ago
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆53Updated last year