saic-fi / xvit_video_transformers
[NeurIPS 2021] Space-time Mixing Attention for Video Transformer
☆15Updated 2 years ago
Alternatives and similar repositories for xvit_video_transformers:
Users that are interested in xvit_video_transformers are comparing it to the libraries listed below
- AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition (ICLR 2021)☆33Updated 3 years ago
- [CVPR 2023] Code for action prediction from videos☆23Updated 10 months ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆39Updated 3 months ago
- [CVPR 2021] 3D CNNs with Adaptive Temporal Feature Resolutions https://arxiv.org/abs/2011.08652☆26Updated 3 years ago
- A simple but efficient transformer model for video action recognition☆56Updated 2 years ago
- Code for CVPR'21 paper "Learning Asynchronous and Sparse Human-Object Interaction in Videos".☆23Updated 3 years ago
- Official PyTorch implementation of "Video Prediction Recalling Long-term Motion Context via Memory Alignment Learning" (CVPR 2021 Oral)☆88Updated 3 years ago
- Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 20…☆26Updated 3 years ago
- ☆32Updated 3 years ago
- AFNet(NeurIPS 2022)☆19Updated 2 years ago
- Code for the ECCV'22 paper "Geometric Features Informed Multi-person Human-object Interaction Recognition in Videos".☆28Updated 11 months ago
- Code and model for "Multi-dataset Training of Transformers for Robust Action Recognition", NeurIPS 2022 Spotlight☆20Updated last year
- ☆16Updated 2 years ago
- Code for our CVPR 2021 paper "Coarse-Fine Networks for Temporal Activity Detection in Videos"☆56Updated 3 years ago
- [ICCV 2021] MGSampler: An Explainable Sampling Strategy for Video Action Recognition☆48Updated 2 years ago
- Official PyTorch Implementation of paper EAN: Event Adaptive Network for Efficient Action Recognition https://arxiv.org/abs/2107.10771☆33Updated last year
- TCPNet☆30Updated 3 years ago
- PyTorch demo code for "Spatial-Temporal Pyramid Based Convolutional Neural Network for Action Recognition"☆15Updated 6 years ago
- ☆16Updated 2 years ago
- Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)☆28Updated 2 years ago
- [IEEE T-IP 2022] TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning☆24Updated last year
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆227Updated 2 years ago
- Official Pytorch Implementation of Relational Self-Attention, NeurIPS 2021☆49Updated 3 years ago
- ☆13Updated last year
- [CVPR 2022] Official Pytorch Implementation for "Spatio-temporal Relation Modeling for Few-shot Action Recognition". SOTA Results for Few…☆99Updated 2 years ago
- Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.☆46Updated 2 years ago
- Semi-Supervised Action Recognition with Temporal Contrastive Learning☆56Updated 10 months ago
- [ICLR2021] official implementation of CT-Net☆37Updated 3 years ago
- This is an official implementation for "Making Vision Transformers Efficient from A Token Sparsification View".☆31Updated 7 months ago
- Codes for arXiv paper "Semi-supervised Few-shot Atomic Action Recognition".☆17Updated 4 years ago