Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)
☆222Aug 23, 2022Updated 3 years ago
Alternatives and similar repositories for STAM
Users that are interested in STAM are comparing it to the libraries listed below
Sorting:
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,830Apr 9, 2024Updated last year
- This is an official implementation for "Video Swin Transformers".☆1,632Mar 8, 2023Updated 2 years ago
- [Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance☆104Aug 12, 2020Updated 5 years ago
- [CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition☆381Sep 17, 2022Updated 3 years ago
- The Pytorch code of the TEA module (Temporal Excitation and Aggregation for Action Recognition)☆200Apr 4, 2022Updated 3 years ago
- Implementation for NATv2.☆23Feb 20, 2021Updated 5 years ago
- Implementation of momentum^2 teacher☆122Jan 27, 2021Updated 5 years ago
- Implementation of ViViT: A Video Vision Transformer☆556Jun 21, 2021Updated 4 years ago
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆787Feb 9, 2023Updated 3 years ago
- Official pytorch implementation of paper "VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples" (CVP…☆148Aug 18, 2021Updated 4 years ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆723Aug 8, 2023Updated 2 years ago
- A deep learning library for video understanding research.☆3,544Jan 12, 2026Updated last month
- [CVPR 2020] Temporal Pyramid Network for Action Recognition☆393Jan 12, 2021Updated 5 years ago
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,184Jul 11, 2024Updated last year
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,297Feb 19, 2026Updated last week
- Official PyTorch Implementation of MotionSqueeze, ECCV 2020☆139Oct 14, 2021Updated 4 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆49Mar 18, 2021Updated 4 years ago
- [ICLR2021] official implementation of CT-Net☆37Dec 29, 2021Updated 4 years ago
- Gate-Shift Networks for Video Action Recognition - CVPR 2020☆150Jun 19, 2020Updated 5 years ago
- An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"☆364Jul 25, 2024Updated last year
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆380May 19, 2022Updated 3 years ago
- ☆73Jun 3, 2022Updated 3 years ago
- ☆74Dec 8, 2022Updated 3 years ago
- An end-to-end PyTorch framework for image and video classification☆1,613Jun 27, 2024Updated last year
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆602Dec 6, 2023Updated 2 years ago
- [CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing☆782Oct 3, 2023Updated 2 years ago
- Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper☆780Jan 11, 2023Updated 3 years ago
- SoT: Delving Deeper into Classification Head for Transformer☆50Dec 24, 2021Updated 4 years ago
- Implementations of Transformers for Video☆24Mar 26, 2021Updated 4 years ago
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆60Jan 20, 2021Updated 5 years ago
- Official Pytorch Implementation of "TResNet: High-Performance GPU-Dedicated Architecture" (WACV 2021)☆478Dec 10, 2024Updated last year
- [CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers☆756Jul 15, 2021Updated 4 years ago
- Use CLIP to represent video for Retrieval Task☆70Mar 1, 2021Updated 5 years ago
- Official code Cross-Covariance Image Transformer (XCiT)☆674Sep 28, 2021Updated 4 years ago
- [CVPR 2021] Code for "Augmentation Strategies for Learning with Noisy Labels".☆113Jan 9, 2022Updated 4 years ago
- VMZ: Model Zoo for Video Modeling☆1,053Jun 17, 2025Updated 8 months ago
- Video Contrastive Learning with Global Context, ICCVW 2021☆162May 30, 2022Updated 3 years ago
- ☆58Jun 18, 2021Updated 4 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆64Mar 9, 2022Updated 3 years ago