Alibaba-MIIL / STAM
Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)
☆219Updated 2 years ago
Alternatives and similar repositories for STAM:
Users that are interested in STAM are comparing it to the libraries listed below
- Official PyTorch Implementation of MotionSqueeze, ECCV 2020☆139Updated 3 years ago
- [NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆288Updated 3 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆158Updated 2 years ago
- Official pytorch implementation of paper "VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples" (CVP…☆146Updated 3 years ago
- [Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance☆102Updated 4 years ago
- The official implementation of G-TAD: Sub-Graph Localization for Temporal Action Detection☆217Updated 3 years ago
- Gate-Shift Networks for Video Action Recognition - CVPR 2020☆149Updated 4 years ago
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆131Updated 3 years ago
- Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'☆111Updated 4 years ago
- Implementation of the paper Video Action Transformer Network☆135Updated 3 years ago
- A pytorch-version implementation codes of paper: "BMN: Boundary-Matching Network for Temporal Action Proposal Generation", which is ac…☆296Updated 3 years ago
- Self-supervised Spatiotemporal Learning via Video Clip Order Prediction☆106Updated last year
- Code for our paper "Weakly-Supervised Action Localization by Generative Attention Modeling" (CVPR2020)☆135Updated 2 years ago
- Long-Term Feature Banks for Detailed Video Understanding☆377Updated 3 years ago
- [CVPR 2020] Temporal Pyramid Network for Action Recognition☆392Updated 4 years ago
- [AAAI 2020] Temporal Interlacing Network☆84Updated 4 years ago
- Official repo for ECCV 2020 paper - RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition☆100Updated 4 years ago
- Graph Convolutional Networks for Temporal Action Localization (ICCV2019)☆322Updated 4 years ago
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆72Updated 4 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆129Updated 3 years ago
- Transforms for video datasets in pytorch☆272Updated 3 years ago
- Temporal-Relational CrossTransformers (CVPR 2021)☆108Updated 3 years ago
- Implementations of Transformers for Video☆23Updated 4 years ago
- PyTorch implementation of X3D models with Multigrid training.☆94Updated 3 years ago
- CVPR2021: Temporal Context Aggregation Network for Temporal Action Proposal Refinement☆71Updated 3 years ago
- Zero-shot video classification by end-to-end training of 3D convolutional neural networks☆146Updated 4 years ago
- TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks (ICCVW 2021)☆112Updated last year
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆99Updated 3 years ago
- The Pytorch code of the TEA module (Temporal Excitation and Aggregation for Action Recognition)☆193Updated 2 years ago
- Implementation of "EPIC-Fusion: Audio-Visual Temporal Binding for Egocentric Action Recognition, ICCV, 2019" in PyTorch☆111Updated 4 years ago