Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)
☆221Aug 23, 2022Updated 3 years ago
Alternatives and similar repositories for STAM
Users that are interested in STAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆134Apr 1, 2021Updated 5 years ago
- Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification☆729Aug 25, 2021Updated 4 years ago
- Official Pytorch Implementation of "PETA: Photo Albums Event Recognition using Transformers Attention" (2021)☆19Aug 23, 2022Updated 3 years ago
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,840Apr 9, 2024Updated 2 years ago
- [Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance☆104Aug 12, 2020Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- [CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition☆381Sep 17, 2022Updated 3 years ago
- This is an official implementation for "Video Swin Transformers".☆1,646Mar 8, 2023Updated 3 years ago
- SoT: Delving Deeper into Classification Head for Transformer☆50Dec 24, 2021Updated 4 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆49Mar 18, 2021Updated 5 years ago
- Gate-Shift Networks for Video Action Recognition - CVPR 2020☆149Jun 19, 2020Updated 5 years ago
- ☆73Jun 3, 2022Updated 3 years ago
- [CVPR 2020] Temporal Pyramid Network for Action Recognition☆392Jan 12, 2021Updated 5 years ago
- Implementation of ViViT: A Video Vision Transformer☆556Jun 21, 2021Updated 4 years ago
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,201Jul 11, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'☆148Aug 25, 2023Updated 2 years ago
- Implementations of Transformers for Video☆24Mar 26, 2021Updated 5 years ago
- Learning Spatiotemporal Features via Video and Text Pair Discrimination☆60Jan 20, 2021Updated 5 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,336Mar 16, 2026Updated 3 weeks ago
- Official pytorch implementation of paper "VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples" (CVP…☆148Aug 18, 2021Updated 4 years ago
- This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"☆608Dec 6, 2023Updated 2 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆64Mar 9, 2022Updated 4 years ago
- Official PyTorch Implementation of paper EAN: Event Adaptive Network for Efficient Action Recognition https://arxiv.org/abs/2107.10771☆33Oct 24, 2023Updated 2 years ago
- A deep learning library for video understanding research.☆3,554Jan 12, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code release for SLIP Self-supervision meets Language-Image Pre-training☆788Feb 9, 2023Updated 3 years ago
- Official Pytorch Implementation of "TResNet: High-Performance GPU-Dedicated Architecture" (WACV 2021)☆478Dec 10, 2024Updated last year
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆730Aug 8, 2023Updated 2 years ago
- Implementation of momentum^2 teacher☆121Jan 27, 2021Updated 5 years ago
- Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]☆377May 19, 2022Updated 3 years ago
- TAM: Temporal Adaptive Module for Video Recognition☆208Aug 18, 2022Updated 3 years ago
- Official PyTorch Implementation of MotionSqueeze, ECCV 2020☆139Oct 14, 2021Updated 4 years ago
- Source code for ABMs.☆13Jul 30, 2021Updated 4 years ago
- Official Pytorch Implementation of: "Asymmetric Loss For Multi-Label Classification"(ICCV, 2021) paper☆790Aug 4, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- [Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant☆23Sep 11, 2023Updated 2 years ago
- VMZ: Model Zoo for Video Modeling☆1,053Jun 17, 2025Updated 9 months ago
- Revisiting Anchor Mechanisms for Temporal Action Localization (TIP 2020)☆36Sep 26, 2021Updated 4 years ago
- AViD Dataset: Anonymized Videos from Diverse Countries☆56Mar 30, 2023Updated 3 years ago
- Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones☆201Mar 24, 2021Updated 5 years ago
- Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper☆780Jan 11, 2023Updated 3 years ago
- ☆47Apr 14, 2022Updated 3 years ago