saic-fi / xvit_video_transformers
[NeurIPS 2021] Space-time Mixing Attention for Video Transformer
☆15Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for xvit_video_transformers
- AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition (ICLR 2021)☆33Updated 3 years ago
- [CVPR 2023] Code for action prediction from videos☆23Updated 8 months ago
- PyTorch implementation of AAAI 2021 paper: A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization☆41Updated 3 years ago
- A simple but efficient transformer model for video action recognition☆55Updated 2 years ago
- Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)☆26Updated last year
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆50Updated last year
- Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 20…☆26Updated 3 years ago
- Official PyTorch implementation of "Video Prediction Recalling Long-term Motion Context via Memory Alignment Learning" (CVPR 2021 Oral)☆86Updated 3 years ago
- [BMVC 2021] A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark☆41Updated 2 years ago
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆45Updated 2 years ago
- AFNet(NeurIPS 2022)☆19Updated 2 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆36Updated last month
- Semi-Supervised Action Recognition with Temporal Contrastive Learning☆56Updated 8 months ago
- [CVPR 2021] 3D CNNs with Adaptive Temporal Feature Resolutions https://arxiv.org/abs/2011.08652☆26Updated 3 years ago
- The official implementation of 'Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation' (CVPR 2…☆45Updated 2 years ago
- Official PyTorch Implementation of paper EAN: Event Adaptive Network for Efficient Action Recognition https://arxiv.org/abs/2107.10771☆33Updated last year
- ☆32Updated 3 years ago
- TCPNet☆30Updated 2 years ago
- Implementation of paper "Modeling Multi-Label Action Dependencies for Temporal Action Localization"☆49Updated last year
- 【ACMMM'2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning☆49Updated 3 years ago
- [CVPR 2022] Official Pytorch Implementation for "Spatio-temporal Relation Modeling for Few-shot Action Recognition". SOTA Results for Few…☆98Updated 2 years ago
- Reducing spatial redundancy in video recognition. SOTA computational efficiency.☆122Updated 2 years ago
- Implementations of some few-shot action recognition methods.☆42Updated 3 years ago
- [CVPR2022] MS-TCT☆54Updated 2 years ago
- ☆32Updated 11 months ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆26Updated last year
- Code for our CVPR 2022 Paper "Hybrid Relation Guided Set Matching for Few-shot Action Recognition".☆26Updated last year
- [ICCV 2021] MGSampler: An Explainable Sampling Strategy for Video Action Recognition☆48Updated 2 years ago
- This is the official implementation of Elaborative Rehearsal for Zero-shot Action Recognition (ICCV2021)☆36Updated 2 years ago
- ☆25Updated last year