JunweiLiang / MultiTrain
Code and model for "Multi-dataset Training of Transformers for Robust Action Recognition", NeurIPS 2022 Spotlight
☆20Updated last year
Alternatives and similar repositories for MultiTrain:
Users that are interested in MultiTrain are comparing it to the libraries listed below
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆31Updated 2 years ago
- Future Transformer for Long-term Action Anticipation (CVPR 2022)☆48Updated 2 years ago
- Code accompanying Ego-Exo: Transferring Visual Representations from Third-person to First-person Videos (CVPR 2021)☆33Updated 3 years ago
- ☆47Updated 2 years ago
- [ICCV 2023] MGMAE: Motion Guided Masking for Video Masked Autoencoding☆20Updated last year
- Official Pytorch Implementation of Relational Self-Attention, NeurIPS 2021☆49Updated 3 years ago
- AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition (ICLR 2021)☆34Updated 4 years ago
- ☆44Updated 3 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Semi-Supervised Temporal Action Detection with Proposal-Free Masking "☆21Updated last year
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆43Updated 6 months ago
- Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"☆107Updated last year
- ☆90Updated 4 months ago
- ☆16Updated last month
- Reducing spatial redundancy in video recognition. SOTA computational efficiency.☆125Updated 4 months ago
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"☆36Updated 2 years ago
- Learning Representational Invariances for Data-Efficient Action Recognition☆33Updated 3 years ago
- The official project for the paper: Slot-VPS: Object-centric Representation Learning for Video Panoptic Segmentation, CVPR 2022☆14Updated 2 years ago
- This is the official released code for our paper, The Emergence of Objectness: Learning Zero-Shot Segmentation from Videos, which has bee…☆53Updated 2 years ago
- CVPR 2021 VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild☆30Updated 2 years ago
- [WACV2021] Implementation of Pyramid Dilated Attention Network (PDAN)☆20Updated 2 years ago
- [ICCV2023] Spatio-temporal Prompting Network for Robust Video Feature Extraction☆11Updated last year
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆45Updated last year
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning☆35Updated 2 years ago
- ☆27Updated 4 years ago
- Official Code Release for Container : Context Aggregation Network☆46Updated 3 years ago
- [CVPR 2023] Official repository for paper "Stare at What You See: Masked Image Modeling without Reconstruction"☆69Updated last year
- [ICCV'2023 Oral] Implicit Temporal Modeling with Learnable Alignment for Video Recognition☆35Updated last year
- Code for the VOST dataset☆25Updated last year
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆62Updated 3 years ago
- Learning from Temporal Gradient for Semi-supervised Action Recognition (CVPR 2022)☆29Updated 2 years ago