Alibaba-MIIL / STAMLinks
Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)
☆222Updated 3 years ago
Alternatives and similar repositories for STAM
Users that are interested in STAM are comparing it to the libraries listed below
Sorting:
- Official PyTorch Implementation of MotionSqueeze, ECCV 2020☆138Updated 3 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆158Updated 3 years ago
- Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'☆111Updated 4 years ago
- [NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆288Updated 3 years ago
- PyTorch implementation of X3D models with Multigrid training.☆94Updated 3 years ago
- Implementation of the paper Video Action Transformer Network☆137Updated 4 years ago
- FingerRec / Self-Supervised-Temporal-Discriminative-Representation-Learning-for-Video-Action-Recognition[Arxiv2020] The code for our paper 《Self-Supervised Temporal-Discriminative Representation Learning for Video Action Recognition》 https:/…☆76Updated 4 years ago
- Official pytorch implementation of paper "VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples" (CVP…☆147Updated 4 years ago
- Official repo for ECCV 2020 paper - RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition☆99Updated 4 years ago
- [Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance☆104Updated 5 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆100Updated 4 years ago
- Gate-Shift Networks for Video Action Recognition - CVPR 2020☆149Updated 5 years ago
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆134Updated 4 years ago
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆73Updated 4 years ago
- Feature Extractor module for videos using the PySlowFast framework☆79Updated 4 years ago
- [AAAI 2020] Temporal Interlacing Network☆84Updated 4 years ago
- Official Pytorch Implementation of 'Background Suppression Network for Weakly-supervised Temporal Action Localization' (AAAI-20 Spotlight…☆171Updated last year
- Efficient 3D Backbone Network for Temporal Modeling☆109Updated 4 years ago
- Transforms for video datasets in pytorch☆276Updated 4 years ago
- The official implementation of G-TAD: Sub-Graph Localization for Temporal Action Detection☆221Updated 4 years ago
- [CVPR 2020] Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation (PyTorch)☆155Updated 4 years ago
- HACS: Human Action Clips and Segments Dataset☆194Updated 5 years ago
- Self-supervised Spatiotemporal Learning via Video Clip Order Prediction☆106Updated 2 years ago
- [ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆165Updated 4 years ago
- Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch☆113Updated 4 years ago
- Code for : [Pattern Recognit. Lett. 2021] "Learn to cycle: Time-consistent feature discovery for action recognition" and [IJCNN 2021] "Mu…☆68Updated 2 years ago
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆230Updated 3 years ago
- [CVPR 2021] Multi-shot Temporal Event Localization: a Benchmark☆55Updated 3 years ago
- ☆71Updated 4 years ago
- Datasets, transforms and samplers for video in PyTorch☆88Updated last year