sail-sg / dualformer
☆29Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for dualformer
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- ☆16Updated 3 years ago
- ☆16Updated last year
- Teach-DETR: Better Training DETR with Teachers☆29Updated 8 months ago
- ☆32Updated 2 years ago
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 2 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 3 years ago
- Benchmarking Attention Mechanism in Vision Transformers.☆16Updated 2 years ago
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆45Updated 2 years ago
- Turning to Video for Transcript Sorting☆46Updated last year
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Updated 2 years ago
- ☆27Updated 2 years ago
- Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".☆16Updated last year
- open source the research work for published on arxiv. https://arxiv.org/abs/2106.02689☆51Updated 2 years ago
- [ICLR2024] Exploring Target Representations for Masked Autoencoders☆51Updated 10 months ago
- code base for vision transformers☆36Updated 2 years ago
- ☆20Updated last year
- AFNet(NeurIPS 2022)☆19Updated 2 years ago
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 2 years ago
- [CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning☆22Updated 2 years ago
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆37Updated 2 years ago
- ☆32Updated 2 years ago
- Lightweight Transformer for Multi-modal Tasks☆15Updated last year
- [ICLR2024] The official implementation of paper "UniAdapter: Unified Parameter-Efficient Transfer Learning for Cross-modal Modeling", by …☆70Updated 9 months ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆14Updated last year
- PyTorch reimplementation of "A simple, efficient and scalable contrastive masked autoencoder for learning visual representations".☆36Updated last year