saic-fi / xvit_video_transformers
[NeurIPS 2021] Space-time Mixing Attention for Video Transformer
☆15Updated 3 years ago
Alternatives and similar repositories for xvit_video_transformers:
Users that are interested in xvit_video_transformers are comparing it to the libraries listed below
- AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition (ICLR 2021)☆34Updated 4 years ago
- [CVPR 2023] Code for action prediction from videos☆25Updated last year
- Official PyTorch implementation of "Video Prediction Recalling Long-term Motion Context via Memory Alignment Learning" (CVPR 2021 Oral)☆89Updated 3 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆44Updated 6 months ago
- A simple but efficient transformer model for video action recognition☆58Updated 2 years ago
- Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)☆29Updated 2 years ago
- [ICCV 2021] MGSampler: An Explainable Sampling Strategy for Video Action Recognition☆48Updated 2 years ago
- ☆33Updated 4 years ago
- TCPNet☆30Updated 3 years ago
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆47Updated 2 years ago
- [CVPR 2022 Oral] Towards Open Set Temporal Action Localization☆53Updated last year
- Official Pytorch Implementation of Relational Self-Attention, NeurIPS 2021☆49Updated 3 years ago
- ☆16Updated 2 years ago
- ☆61Updated 4 years ago
- [CVPR 2022] An Empirical Study of End-to-end Temporal Action Detection☆83Updated 2 years ago
- ☆44Updated 3 years ago
- Reducing spatial redundancy in video recognition. SOTA computational efficiency.☆124Updated 4 months ago
- Replace the MS-TCN with ASFormer in asrf☆21Updated 3 years ago
- Implementation of paper "Modeling Multi-Label Action Dependencies for Temporal Action Localization"☆50Updated 2 years ago
- Code for our CVPR 2021 paper "Coarse-Fine Networks for Temporal Activity Detection in Videos"☆57Updated 3 years ago
- ☆52Updated 2 years ago
- Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 20…☆26Updated 3 years ago
- [CVPR 2021] 3D CNNs with Adaptive Temporal Feature Resolutions https://arxiv.org/abs/2011.08652☆26Updated 3 years ago
- Code and model for "Multi-dataset Training of Transformers for Robust Action Recognition", NeurIPS 2022 Spotlight☆20Updated last year
- [CVPR2022] MS-TCT☆54Updated 2 years ago
- AFNet(NeurIPS 2022)☆19Updated 2 years ago
- 【ACMMM'2021】DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning☆42Updated 3 years ago
- This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".☆34Updated 2 months ago
- Codes for arXiv paper "Semi-supervised Few-shot Atomic Action Recognition".☆17Updated 4 years ago
- Future Transformer for Long-term Action Anticipation (CVPR 2022)☆48Updated 2 years ago