sail-sg / dualformer
☆30Updated 2 years ago
Alternatives and similar repositories for dualformer:
Users that are interested in dualformer are comparing it to the libraries listed below
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 2 years ago
- ☆33Updated 2 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 4 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Updated 2 years ago
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- ☆47Updated 2 years ago
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆46Updated 2 years ago
- ☆20Updated 2 years ago
- code base for vision transformers☆36Updated 3 years ago
- ☆52Updated 2 years ago
- Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper☆30Updated 2 years ago
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆48Updated 2 years ago
- ☆16Updated last year
- Turning to Video for Transcript Sorting☆48Updated last year
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Updated last year
- ☆70Updated last year
- ☆32Updated 2 years ago
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆77Updated last year
- ☆16Updated 3 months ago
- A PyTorch implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"☆18Updated 3 years ago
- Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)☆30Updated 3 years ago
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Updated 2 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆42Updated 5 months ago
- Distribution-Aware Prompt Tuning for Vision-Language Models (ICCV 2023)☆38Updated last year
- 📍 Official pytorch implementation of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS)☆52Updated last year
- Towards a Unified View on Visual Parameter-Efficient Transfer Learning☆26Updated 2 years ago
- [ICLR 23] Contrastive Aligned of Vision to Language Through Parameter-Efficient Transfer Learning☆38Updated last year
- repo for paper titled: Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment (AAAI'24 Oral)☆25Updated 10 months ago
- [CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning☆23Updated 2 years ago
- Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification. ECCV 2022.☆19Updated 2 years ago