Alibaba-MIIL/STAM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Alibaba-MIIL/STAM)

Alibaba-MIIL / STAM

Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)

☆221

Alternatives and similar repositories for STAM

Users that are interested in STAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lucidrains / STAM-pytorch
View on GitHub
Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification
☆133Apr 1, 2021Updated 5 years ago
lucidrains / TimeSformer-pytorch
View on GitHub
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
☆729Aug 25, 2021Updated 4 years ago
Alibaba-MIIL / PETA
View on GitHub
Official Pytorch Implementation of "PETA: Photo Albums Event Recognition using Transformers Attention" (2021)
☆20Aug 23, 2022Updated 3 years ago
facebookresearch / TimeSformer
View on GitHub
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
☆1,863Apr 9, 2024Updated 2 years ago
zhang-can / PAN-PyTorch
View on GitHub
[Codes of paper]: PAN: Towards Fast Action Recognition via Learning Persistence of Appearance
☆104Aug 12, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Phoenix1327 / tea-action-recognition
View on GitHub
The Pytorch code of the TEA module (Temporal Excitation and Aggregation for Action Recognition)
☆201Apr 4, 2022Updated 4 years ago
MCG-NJU / TDN
View on GitHub
[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
☆384Sep 17, 2022Updated 3 years ago
SwinTransformer / Video-Swin-Transformer
View on GitHub
This is an official implementation for "Video Swin Transformers".
☆1,667Mar 8, 2023Updated 3 years ago
jiangtaoxie / SoT
View on GitHub
SoT: Delving Deeper into Classification Head for Transformer
☆50Dec 24, 2021Updated 4 years ago
sjenni / temporal-ssl
View on GitHub
Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.
☆49Mar 18, 2021Updated 5 years ago
swathikirans / GSM
View on GitHub
Gate-Shift Networks for Video Action Recognition - CVPR 2020
☆149Jun 19, 2020Updated 6 years ago
airsplay / vimpac
View on GitHub
☆73Jun 3, 2022Updated 4 years ago
decisionforce / TPN
View on GitHub
[CVPR 2020] Temporal Pyramid Network for Action Recognition
☆394Jan 12, 2021Updated 5 years ago
rishikksh20 / ViViT-pytorch
View on GitHub
Implementation of ViViT: A Video Vision Transformer
☆558Jun 21, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mit-han-lab / temporal-shift-module
View on GitHub
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
☆2,215Jul 11, 2024Updated 2 years ago
joaanna / something_else
View on GitHub
Code repository for the paper: 'Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks'
☆148Aug 25, 2023Updated 2 years ago
m-bain / video-transformers
View on GitHub
Implementations of Transformers for Video
☆24Mar 26, 2021Updated 5 years ago
MCG-NJU / CPD-Video
View on GitHub
Learning Spatiotemporal Features via Video and Text Pair Discrimination
☆60Jan 20, 2021Updated 5 years ago
facebookresearch / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,391Mar 16, 2026Updated 4 months ago
sallymmx / ActionCLIP
View on GitHub
This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"
☆613Dec 6, 2023Updated 2 years ago
tinapan-pt / VideoMoCo
View on GitHub
Official pytorch implementation of paper "VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples" (CVP…
☆148Aug 18, 2021Updated 4 years ago
Chuhanxx / Temporal_Query_Networks
View on GitHub
The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding
☆64Mar 9, 2022Updated 4 years ago
tianyuan168326 / EAN-Pytorch
View on GitHub
Official PyTorch Implementation of paper EAN: Event Adaptive Network for Efficient Action Recognition https://arxiv.org/abs/2107.10771
☆33Oct 24, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
facebookresearch / pytorchvideo
View on GitHub
A deep learning library for video understanding research.
☆3,565May 5, 2026Updated 2 months ago
facebookresearch / SLIP
View on GitHub
Code release for SLIP Self-supervision meets Language-Image Pre-training
☆792Feb 9, 2023Updated 3 years ago
Alibaba-MIIL / TResNet
View on GitHub
Official Pytorch Implementation of "TResNet: High-Performance GPU-Dedicated Architecture" (WACV 2021)
☆478Dec 10, 2024Updated last year
zengarden / momentum2-teacher
View on GitHub
Implementation of momentum^2 teacher
☆120Jan 27, 2021Updated 5 years ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
m-bain / frozen-in-time
View on GitHub
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval [ICCV'21]
☆377May 19, 2022Updated 4 years ago
liu-zhy / temporal-adaptive-module
View on GitHub
TAM: Temporal Adaptive Module for Video Recognition
☆207Aug 18, 2022Updated 3 years ago
arunos728 / MotionSqueeze
View on GitHub
Official PyTorch Implementation of MotionSqueeze, ECCV 2020
☆139Oct 14, 2021Updated 4 years ago
zhuxinqimac / abm-pytorch
View on GitHub
Source code for ABMs.
☆13Jul 30, 2021Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Alibaba-MIIL / ASL
View on GitHub
Official Pytorch Implementation of: "Asymmetric Loss For Multi-Label Classification"(ICCV, 2021) paper
☆803Aug 4, 2023Updated 2 years ago
StanLei52 / TQVSR
View on GitHub
[Findings of EMNLP 2022] AssistSR: Task-oriented Video Segment Retrieval for Personal AI Assistant
☆24Sep 11, 2023Updated 2 years ago
facebookresearch / VMZ
View on GitHub
VMZ: Model Zoo for Video Modeling
☆1,053Jun 17, 2025Updated last year
VividLe / A2Net
View on GitHub
Revisiting Anchor Mechanisms for Temporal Action Localization (TIP 2020)
☆36Sep 26, 2021Updated 4 years ago
piergiaj / AViD
View on GitHub
AViD Dataset: Anonymized Videos from Diverse Countries
☆54Mar 30, 2023Updated 3 years ago
lucidrains / halonet-pytorch
View on GitHub
Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones
☆199Mar 24, 2021Updated 5 years ago
Alibaba-MIIL / ImageNet21K
View on GitHub
Official Pytorch Implementation of: "ImageNet-21K Pretraining for the Masses"(NeurIPS, 2021) paper
☆779Jan 11, 2023Updated 3 years ago