sail-sg / dualformer
☆30Updated 2 years ago
Alternatives and similar repositories for dualformer:
Users that are interested in dualformer are comparing it to the libraries listed below
- Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.☆28Updated 2 years ago
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆46Updated 2 years ago
- [WACV'22] Code repository for the paper "Self-supervised Video Representation Learning with Cross-Stream Prototypical Contrasting", https…☆36Updated 2 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Updated 2 years ago
- ☆20Updated 2 years ago
- Code for Motion-aware Contrastive Video Representation Learning via Foreground-background Merging (CVPR 2022)☆46Updated last year
- [CVPR 2022] Cross-Architecture Self-supervised Video Representation Learning☆23Updated 2 years ago
- The Curious Layperson: Fine-Grained Image Recognition without Expert Labels (BMVC 2021 best student paper)☆23Updated 2 years ago
- Test different pooling method used in CNN for Computer Vision Task☆35Updated 4 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆41Updated 5 months ago
- ☆47Updated 2 years ago
- ☆16Updated last year
- [ECCV 2022] Official Pytorch Implementation of paper : " Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning "…☆18Updated 2 years ago
- ☆27Updated 2 years ago
- code base for vision transformers☆36Updated 3 years ago
- [CVPR2022 Oral] The official code for "TransRank: Self-supervised Video Representation Learning via Ranking-based Transformation Recognit…☆18Updated 2 years ago
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆52Updated last year
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Updated last year
- TokenMix: Rethinking Image Mixing for Data Augmentation in Vision Transformers (ECCV 2022)☆93Updated 2 years ago
- Code repo for "FASA: Feature Augmentation and Sampling Adaptation for Long-Tailed Instance Segmentation" (ICCV 2021)☆30Updated 3 years ago
- ☆70Updated last year
- Code and models for the paper Glance-and-Gaze Vision Transformer☆28Updated 3 years ago
- PyTorch Implementation of "Your ViT is Secretly a Hybrid Discriminative-Generative Diffusion Model"☆48Updated 2 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- ☆54Updated 2 years ago
- Lightweight Transformer for Multi-modal Tasks☆15Updated 2 years ago
- Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(ICCV, 2021) paper☆30Updated 2 years ago
- Teach-DETR: Better Training DETR with Teachers☆30Updated 11 months ago
- Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention (CVPR 2023)☆32Updated last year
- Beyond Masking: Demystifying Token-Based Pre-Training for Vision Transformers☆26Updated 2 years ago