haofanwang/video-swin-transformer-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/haofanwang/video-swin-transformer-pytorch)

haofanwang / video-swin-transformer-pytorch

Video Swin Transformer - PyTorch

☆269

Alternatives and similar repositories for video-swin-transformer-pytorch

Users that are interested in video-swin-transformer-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SwinTransformer / Video-Swin-Transformer
View on GitHub
This is an official implementation for "Video Swin Transformers".
☆1,667Mar 8, 2023Updated 3 years ago
rishikksh20 / ViViT-pytorch
View on GitHub
Implementation of ViViT: A Video Vision Transformer
☆559Jun 21, 2021Updated 5 years ago
mx-mark / VideoTransformer-pytorch
View on GitHub
PyTorch implementation of a collections of scalable Video Transformer Benchmarks.
☆306May 4, 2022Updated 4 years ago
facebookresearch / TimeSformer
View on GitHub
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
☆1,864Apr 9, 2024Updated 2 years ago
microsoft / SwinBERT
View on GitHub
Research code for CVPR 2022 paper "SwinBERT: End-to-End Transformers with Sparse Attention for Video Captioning"
☆251May 26, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xyzforever / BEVT
View on GitHub
PyTorch implementation of BEVT (CVPR 2022) https://arxiv.org/abs/2112.01529
☆161Jul 19, 2022Updated 4 years ago
Tramac / tiny-kinetics-400
View on GitHub
Tiny Kinetics-400 for test
☆101Feb 21, 2024Updated 2 years ago
saic-fi / xvit_video_transformers
View on GitHub
[NeurIPS 2021] Space-time Mixing Attention for Video Transformer
☆17Mar 18, 2022Updated 4 years ago
baiyang4 / D-LSG-Video-Caption
View on GitHub
☆26Oct 20, 2021Updated 4 years ago
ylqi / GL-RG
View on GitHub
The code of IJCAI22 paper "GL-RG: Global-Local Representation Granularity for Video Captioning".
☆18May 10, 2023Updated 3 years ago
microsoft / Swin-Transformer
View on GitHub
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
☆16,010Jul 24, 2024Updated 2 years ago
sunlicai / SVFAP
View on GitHub
[TAC 2024] SVFAP: Self-supervised Video Facial Affect Perceiver
☆26Sep 25, 2024Updated last year
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,775Dec 8, 2023Updated 2 years ago
berniwal / swin-transformer-pytorch
View on GitHub
Implementation of the Swin Transformer in PyTorch.
☆862Mar 29, 2021Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
MarcusNerva / HMN
View on GitHub
[CVPR2022] Official code for Hierarchical Modular Network for Video Captioning. Our proposed HMN is implemented with PyTorch.
☆50Sep 30, 2022Updated 3 years ago
facebookresearch / mvit
View on GitHub
Code Release for MViTv2 on Image Recognition.
☆456Nov 26, 2024Updated last year
MohammadRezaQaderi / Video-Swin-Transformer
View on GitHub
This is Video Swin Transformer to recognize the video with Machine Vision
☆19Sep 4, 2021Updated 4 years ago
lucidrains / TimeSformer-pytorch
View on GitHub
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
☆730Aug 25, 2021Updated 4 years ago
titania7777 / UCF101FewShot
View on GitHub
Testing code for few-shot action recognition
☆11Jan 12, 2021Updated 5 years ago
drv-agwl / ViViT-pytorch
View on GitHub
☆69Apr 26, 2021Updated 5 years ago
v-iashin / video_features
View on GitHub
Extract video features from raw videos using multiple GPUs. We support RAFT flow frames as well as S3D, I3D, R(2+1)D, VGGish, CLIP, and T…
☆654Feb 1, 2026Updated 5 months ago
piergiaj / pytorch-i3d
View on GitHub
☆1,051Jun 28, 2020Updated 6 years ago
open-mmlab / mmaction2
View on GitHub
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
☆5,104Mar 18, 2026Updated 4 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
liu-zhy / temporal-adaptive-module
View on GitHub
TAM: Temporal Adaptive Module for Video Recognition
☆207Aug 18, 2022Updated 3 years ago
bomri / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆87Sep 13, 2021Updated 4 years ago
bellos1203 / TCD
View on GitHub
Code for "Class-Incremental Learning for Action Recognition in Videos", ICCV 2021
☆22Oct 14, 2022Updated 3 years ago
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,436Jun 22, 2026Updated last month
TengdaHan / TemporalAlignNet
View on GitHub
[CVPR'22 Oral] Temporal Alignment Networks for Long-term Video. Tengda Han, Weidi Xie, Andrew Zisserman.
☆122Oct 9, 2023Updated 2 years ago
WingsBrokenAngel / MSR-VTT-DataCleaning
View on GitHub
☆19Dec 22, 2022Updated 3 years ago
tianyu0207 / RTFM
View on GitHub
Official code for 'Weakly-supervised Video Anomaly Detection with Robust Temporal Feature Magnitude Learning' [ICCV 2021]
☆345Oct 29, 2025Updated 8 months ago
wangzehui20 / OTAM-Video-via-Temporal-Alignment
View on GitHub
Fast CUDA implementation of (differentiable) otam for PyTorch using Numba
☆16Jun 21, 2021Updated 5 years ago
SimonZeng7108 / Video-SwinUNet
View on GitHub
☆16Jun 13, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
microsoft / SimMIM
View on GitHub
This is an official implementation for "SimMIM: A Simple Framework for Masked Image Modeling".
☆1,047Sep 29, 2022Updated 3 years ago
facebookresearch / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,393Mar 16, 2026Updated 4 months ago
wangjs96 / Graph-in-Graph-Neural-Network
View on GitHub
Graph in Graph Neural Network (https://arxiv.org/abs/2407.00696)
☆16Sep 12, 2024Updated last year
Kim-Byeong-Hun / yolov9-pose
View on GitHub
Human Pose Estimation using YOLOv9
☆18Mar 23, 2024Updated 2 years ago
Sissuire / SAMA
View on GitHub
AAAI-2024
☆22Sep 18, 2025Updated 10 months ago
facebookresearch / Motionformer
View on GitHub
Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers
☆234Jun 13, 2022Updated 4 years ago
CRISTAL-3DSAM / WER-SSL
View on GitHub
☆13Jun 5, 2023Updated 3 years ago