Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)
☆221Aug 23, 2022Updated 3 years ago
Alternatives and similar repositories for STAM
Users that are interested in STAM are comparing it to the libraries listed below
Sorting:
- Official Pytorch Implementation of "PETA: Photo Albums Event Recognition using Transformers Attention" (2021)☆19Aug 23, 2022Updated 3 years ago
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,838Apr 9, 2024Updated last year
- [Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance☆104Aug 12, 2020Updated 5 years ago
- The Pytorch code of the TEA module (Temporal Excitation and Aggregation for Action Recognition)☆200Apr 4, 2022Updated 3 years ago
- [CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition☆381Sep 17, 2022Updated 3 years ago
- This is an official implementation for "Video Swin Transformers".☆1,638Mar 8, 2023Updated 3 years ago
- SoT: Delving Deeper into Classification Head for Transformer☆50Dec 24, 2021Updated 4 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆49Mar 18, 2021Updated 5 years ago
- Gate-Shift Networks for Video Action Recognition - CVPR 2020☆149Jun 19, 2020Updated 5 years ago
- ☆73Jun 3, 2022Updated 3 years ago
- [CVPR 2020] Temporal Pyramid Network for Action Recognition☆392Jan 12, 2021Updated 5 years ago
- Implementation of ViViT: A Video Vision Transformer☆557Jun 21, 2021Updated 4 years ago
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,195Jul 11, 2024Updated last year
- Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'☆148Aug 25, 2023Updated 2 years ago
- Implementations of Transformers for Video☆24Mar 26, 2021Updated 4 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,314Updated this week
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆60Jan 20, 2021Updated 5 years ago
- Official pytorch implementation of paper "VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples" (CVP…☆148Aug 18, 2021Updated 4 years ago
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆603Dec 6, 2023Updated 2 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆64Mar 9, 2022Updated 4 years ago
- Official PyTorch Implementation of paper EAN: Event Adaptive Network for Efficient Action Recognition https://arxiv.org/abs/2107.10771☆33Oct 24, 2023Updated 2 years ago
- A deep learning library for video understanding research.☆3,550Jan 12, 2026Updated 2 months ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- Official Pytorch Implementation of "TResNet: High-Performance GPU-Dedicated Architecture" (WACV 2021)☆478Dec 10, 2024Updated last year
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆730Aug 8, 2023Updated 2 years ago
- Implementation of momentum^2 teacher☆121Jan 27, 2021Updated 5 years ago
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆380May 19, 2022Updated 3 years ago
- TAM: Temporal Adaptive Module for Video Recognition☆208Aug 18, 2022Updated 3 years ago
- Official PyTorch Implementation of MotionSqueeze, ECCV 2020☆139Oct 14, 2021Updated 4 years ago
- Source code for ABMs.☆13Jul 30, 2021Updated 4 years ago
- Official Pytorch Implementation of: "Asymmetric Loss For Multi-Label Classification"(ICCV, 2021) paper☆787Aug 4, 2023Updated 2 years ago
- [Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant☆23Sep 11, 2023Updated 2 years ago
- VMZ: Model Zoo for Video Modeling☆1,053Jun 17, 2025Updated 9 months ago
- Revisiting Anchor Mechanisms for Temporal Action Localization (TIP 2020)☆36Sep 26, 2021Updated 4 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Mar 30, 2023Updated 2 years ago
- Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper☆780Jan 11, 2023Updated 3 years ago
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆219Jul 5, 2022Updated 3 years ago
- An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"☆365Jul 25, 2024Updated last year
- Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 20…☆26Oct 15, 2021Updated 4 years ago