saic-fi / xvit_video_transformers
[NeurIPS 2021] Space-time Mixing Attention for Video Transformer
☆15Updated 2 years ago
Alternatives and similar repositories for xvit_video_transformers:
Users that are interested in xvit_video_transformers are comparing it to the libraries listed below
- AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition (ICLR 2021)☆33Updated 3 years ago
- [CVPR 2023] Code for action prediction from videos☆23Updated 11 months ago
- AFNet(NeurIPS 2022)☆19Updated 2 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆41Updated 4 months ago
- Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 20…☆26Updated 3 years ago
- A simple but efficient transformer model for video action recognition☆58Updated 2 years ago
- Code and model for "Multi-dataset Training of Transformers for Robust Action Recognition", NeurIPS 2022 Spotlight☆20Updated last year
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆49Updated last year
- Official PyTorch Implementation of paper EAN: Event Adaptive Network for Efficient Action Recognition https://arxiv.org/abs/2107.10771☆33Updated last year
- [ICCV 2021] MGSampler: An Explainable Sampling Strategy for Video Action Recognition☆48Updated 2 years ago
- Official PyTorch implementation of "Video Prediction Recalling Long-term Motion Context via Memory Alignment Learning" (CVPR 2021 Oral)☆88Updated 3 years ago
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆46Updated 2 years ago
- ☆52Updated 2 years ago
- Official Pytorch Implementation of Relational Self-Attention, NeurIPS 2021☆49Updated 3 years ago
- ☆33Updated 3 years ago
- Official Implementation of the paper "Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Transl…☆33Updated 2 years ago
- [CVPR 2023] Enlarge Instance-specific and Class-specific Information for Open-set Action Recognition☆28Updated last year
- Implementation of paper "Modeling Multi-Label Action Dependencies for Temporal Action Localization"☆50Updated last year
- This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".☆33Updated 2 weeks ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆33Updated 3 years ago
- [CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection☆83Updated 2 years ago
- [CVPR 2021] 3D CNNs with Adaptive Temporal Feature Resolutions https://arxiv.org/abs/2011.08652☆26Updated 3 years ago
- Code associated with "M2A: Motion Aware Attention for Accurate Video Action Recognition"☆12Updated 3 years ago
- Unsupervised Film Genre Classification using Spatio-Temporal Contrastive Learning☆32Updated last year
- SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]☆19Updated 2 years ago
- Code for CVPR'21 paper "Learning Asynchronous and Sparse Human-Object Interaction in Videos".☆24Updated 3 years ago
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆34Updated last year
- 【ACMMM'2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning☆42Updated 3 years ago
- Repository for ECCV 2022 paper "Source-free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition"☆24Updated last year
- [ECCV 2022] Official Pytorch Implementation of the paper : " Semi-Supervised Temporal Action Detection with Proposal-Free Masking "☆21Updated last year