Alibaba-MIIL / STAM
Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)
☆219Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for STAM
- [NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆287Updated 3 years ago
- Official PyTorch Implementation of MotionSqueeze, ECCV 2020☆139Updated 3 years ago
- Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'☆110Updated 3 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆158Updated 2 years ago
- Official pytorch implementation of paper "VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples" (CVP…☆142Updated 3 years ago
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆130Updated 3 years ago
- PyTorch implementation of X3D models with Multigrid training.☆93Updated 3 years ago
- [Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance☆102Updated 4 years ago
- The official implementation of G-TAD: Sub-Graph Localization for Temporal Action Detection☆216Updated 3 years ago
- Self-supervised Spatiotemporal Learning via Video Clip Order Prediction☆102Updated last year
- Implementation of the paper Video Action Transformer Network☆135Updated 3 years ago
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆70Updated 3 years ago
- Gate-Shift Networks for Video Action Recognition - CVPR 2020☆149Updated 4 years ago
- [ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆164Updated 3 years ago
- [ECCV 2020] Boundary-Aware Cascade Networks for Temporal Action Segmentation☆84Updated 3 years ago
- FingerRec / Self-Supervised-Temporal-Discriminative-Representation-Learning-for-Video-Action-Recognition[Arxiv2020] The code for our paper 《Self-Supervised Temporal-Discriminative Representation Learning for Video Action Recognition》 https:/…☆77Updated 4 years ago
- [AAAI 2020] Temporal Interlacing Network☆85Updated 3 years ago
- [ICCV 2019] Official implementation of Temporal Recurrent Networks for Online Action Detection☆83Updated 2 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆99Updated 3 years ago
- A pytorch-version implementation codes of paper: "BMN: Boundary-Matching Network for Temporal Action Proposal Generation", which is ac…☆291Updated 2 years ago
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆226Updated 2 years ago
- PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529☆158Updated 2 years ago
- CVPR2021: Temporal Context Aggregation Network for Temporal Action Proposal Refinement☆70Updated 3 years ago
- I3D Nonlocal ResNets in Pytorch☆246Updated 2 years ago
- Feature Extractor module for videos using the PySlowFast framework☆77Updated 3 years ago
- TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks (ICCVW 2021)☆108Updated last year
- Transforms for video datasets in pytorch☆269Updated 3 years ago
- [CVPR 2020] Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation (PyTorch)☆154Updated 3 years ago
- Implementations of Transformers for Video☆24Updated 3 years ago
- Code for our paper "Weakly-Supervised Action Localization by Generative Attention Modeling" (CVPR2020)☆136Updated last year