sail-sg / dualformerLinks
☆31Updated 2 years ago
Alternatives and similar repositories for dualformer
Users that are interested in dualformer are comparing it to the libraries listed below
Sorting:
- ☆33Updated 2 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Updated 3 years ago
- ☆47Updated 2 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 3 years ago
- ☆16Updated last year
- ☆55Updated 2 years ago
- ☆72Updated last year
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆47Updated 2 years ago
- [WACV2025 Oral] DeepMIM: Deep Supervision for Masked Image Modeling☆53Updated 3 weeks ago
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆49Updated 2 years ago
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 3 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 4 years ago
- Code for paper "Point and Ask: Incorporating Pointing into Visual Question Answering"☆19Updated 2 years ago
- CVPR2022:Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency☆17Updated 2 years ago
- [CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning☆23Updated 2 years ago
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Updated last year
- Turning to Video for Transcript Sorting☆48Updated last year
- Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper☆30Updated 2 years ago
- ☆52Updated 2 years ago
- Code for CVPR2023 paper "Collaborative Noisy Label Cleaner: Learning Scene-aware Trailers for Multi-modal Highlight Detection in Movies"☆17Updated 2 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆46Updated 7 months ago
- code base for vision transformers☆36Updated 3 years ago
- ☆23Updated 2 years ago
- [CVPR 2022] "The Principle of Diversity: Training Stronger Vision Transformers Calls for Reducing All Levels of Redundancy" by Tianlong C…☆25Updated 3 years ago
- ICCV2023: Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning☆41Updated last year
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Updated last year
- ☆20Updated 2 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆35Updated last year
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆36Updated 2 years ago