Alibaba-MIIL / STAM
Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)
☆219Updated 2 years ago
Alternatives and similar repositories for STAM:
Users that are interested in STAM are comparing it to the libraries listed below
- Official PyTorch Implementation of MotionSqueeze, ECCV 2020☆139Updated 3 years ago
- [NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆288Updated 3 years ago
- Official implementation of ACMMM'20 paper 'Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework'☆111Updated 3 years ago
- [Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance☆102Updated 4 years ago
- Gate-Shift Networks for Video Action Recognition - CVPR 2020☆149Updated 4 years ago
- A pytorch-version implementation codes of paper: "BMN: Boundary-Matching Network for Temporal Action Proposal Generation", which is ac…☆296Updated 3 years ago
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆131Updated 3 years ago
- Self-supervised Spatiotemporal Learning via Video Clip Order Prediction☆105Updated last year
- PyTorch implementation of X3D models with Multigrid training.☆94Updated 3 years ago
- Implementation of the paper Video Action Transformer Network☆135Updated 3 years ago
- Official pytorch implementation of paper "VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples" (CVP…☆147Updated 3 years ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆158Updated 2 years ago
- Datasets, transforms and samplers for video in PyTorch☆87Updated last year
- The official implementation of G-TAD: Sub-Graph Localization for Temporal Action Detection☆217Updated 3 years ago
- [AAAI 2020] Temporal Interlacing Network☆84Updated 4 years ago
- [CVPR 2020] Action Segmentation with Joint Self-Supervised Temporal Domain Adaptation (PyTorch)☆156Updated 4 years ago
- Long-Term Feature Banks for Detailed Video Understanding☆377Updated 3 years ago
- The Holistic Video Understanding Dataset (ECCV 2020 Spotlight presentation)☆72Updated 4 years ago
- [CVPR 2020] Temporal Pyramid Network for Action Recognition☆392Updated 4 years ago
- [ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆164Updated 3 years ago
- Transforms for video datasets in pytorch☆272Updated 3 years ago
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆228Updated 2 years ago
- Code for our paper "Weakly-Supervised Action Localization by Generative Attention Modeling" (CVPR2020)☆135Updated 2 years ago
- [ECCV 2020] Boundary-Aware Cascade Networks for Temporal Action Segmentation☆84Updated 4 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆99Updated 3 years ago
- Feature Extractor module for videos using the PySlowFast framework☆78Updated 3 years ago
- Listen to Look: Action Recognition by Previewing Audio (CVPR 2020)☆128Updated 3 years ago
- I3D Nonlocal ResNets in Pytorch☆251Updated 2 years ago
- FingerRec / Self-Supervised-Temporal-Discriminative-Representation-Learning-for-Video-Action-Recognition[Arxiv2020] The code for our paper 《Self-Supervised Temporal-Discriminative Representation Learning for Video Action Recognition》 https:/…☆76Updated 4 years ago
- Temporal-Relational CrossTransformers (CVPR 2021)☆108Updated 3 years ago