mx-mark/VideoTransformer-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mx-mark/VideoTransformer-pytorch)

mx-mark / VideoTransformer-pytorch

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

☆306

Alternatives and similar repositories for VideoTransformer-pytorch

Users that are interested in VideoTransformer-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rishikksh20 / ViViT-pytorch
View on GitHub
Implementation of ViViT: A Video Vision Transformer
☆559Jun 21, 2021Updated 5 years ago
noureldien / vivit_pytorch
View on GitHub
Implementation of ViViT: A Video Vision Transformer - Zipping Coding Challenge
☆33Jun 10, 2021Updated 5 years ago
lucidrains / STAM-pytorch
View on GitHub
Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification
☆133Apr 1, 2021Updated 5 years ago
facebookresearch / TimeSformer
View on GitHub
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
☆1,863Apr 9, 2024Updated 2 years ago
haofanwang / video-swin-transformer-pytorch
View on GitHub
Video Swin Transformer - PyTorch
☆269Jan 4, 2022Updated 4 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
KSonPham / ViVit-a-Pytorch-implementation
View on GitHub
☆23Nov 18, 2022Updated 3 years ago
drv-agwl / ViViT-pytorch
View on GitHub
☆69Apr 26, 2021Updated 5 years ago
MCG-NJU / VideoMAE
View on GitHub
[NeurIPS 2022 Spotlight] VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
☆1,775Dec 8, 2023Updated 2 years ago
bomri / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆87Sep 13, 2021Updated 4 years ago
fcakyon / video-transformers
View on GitHub
Easiest way of fine-tuning HuggingFace video classification models
☆148Mar 20, 2023Updated 3 years ago
google-research / scenic
View on GitHub
Scenic: A Jax Library for Computer Vision Research and Beyond
☆3,819Jul 9, 2026Updated last week
lucidrains / TimeSformer-pytorch
View on GitHub
Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
☆730Aug 25, 2021Updated 4 years ago
rvl-lab-utoronto / video_similarity_search
View on GitHub
SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]
☆19Jan 27, 2023Updated 3 years ago
SforAiDl / vformer
View on GitHub
A modular PyTorch library for vision transformer models
☆165Oct 28, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
facebookresearch / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,392Mar 16, 2026Updated 4 months ago
wdrink / STTS
View on GitHub
Official PyTorch implementation of the ECCV 2022 paper: Efficient Video Transformers with Spatial-Temporal Token Selection.
☆52Jul 13, 2022Updated 4 years ago
ShipuLoveMili / CVPR2022-AURL
View on GitHub
This is the implementation of our AURL paper "Alignment-Uniformity aware Representation Learning for Zero-shot Video Classification".
☆15May 13, 2022Updated 4 years ago
open-mmlab / mmaction2
View on GitHub
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
☆5,099Mar 18, 2026Updated 4 months ago
cvdfoundation / kinetics-dataset
View on GitHub
☆981May 15, 2024Updated 2 years ago
ZhenxingZheng / S-TPNet
View on GitHub
PyTorch demo code for "Spatial-Temporal Pyramid Based Convolutional Neural Network for Action Recognition"
☆15Oct 17, 2018Updated 7 years ago
sangho-vision / avbert
View on GitHub
☆31Sep 20, 2021Updated 4 years ago
AzadDeihim / STTRE
View on GitHub
☆12Oct 17, 2023Updated 2 years ago
xuyu0010 / ATCoN
View on GitHub
Repository for ECCV 2022 paper "Source-free Video Domain Adaptation by Learning Temporal Consistency for Action Recognition"
☆24Mar 9, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
daniel-code / TubeViT
View on GitHub
An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"
☆95Updated this week
taeoh-kim / temporal_data_augmentation
View on GitHub
Code for Temporal Data Augmentations (ECCVW 2020)
☆37Aug 18, 2020Updated 5 years ago
facebookresearch / Motionformer
View on GitHub
Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers
☆234Jun 13, 2022Updated 4 years ago
sallymmx / ActionCLIP
View on GitHub
This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"
☆614Dec 6, 2023Updated 2 years ago
facebookresearch / pytorchvideo
View on GitHub
A deep learning library for video understanding research.
☆3,565May 5, 2026Updated 2 months ago
neda77aa / FTC
View on GitHub
This repo holds the code for: {Transformer-based Spatio-temporal Analysis for Automatic Classification of Aortic Stenosis Severity from B…
☆12Nov 29, 2022Updated 3 years ago
EnergyWeatherAI / SolarSTEPS
View on GitHub
☆14Jan 15, 2026Updated 6 months ago
Sense-X / UniFormer
View on GitHub
[ICLR2022] official implementation of UniFormer
☆906Mar 29, 2024Updated 2 years ago
alibaba-mmai-research / TAdaConv
View on GitHub
[ICLR 2022] TAda! Temporally-Adaptive Convolutions for Video Understanding. This codebase provides solutions for video classification, vi…
☆246Aug 23, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
yiyixuxu / TimeSformer-rolled-attention
View on GitHub
Visualizing the learned space-time attention using Attention Rollout
☆41Apr 1, 2022Updated 4 years ago
yuanyao366 / PRP
View on GitHub
☆40May 7, 2022Updated 4 years ago
saic-fi / xvit_video_transformers
View on GitHub
[NeurIPS 2021] Space-time Mixing Attention for Video Transformer
☆17Mar 18, 2022Updated 4 years ago
kylemin / S3D
View on GitHub
Release of the pretrained S3D Network in PyTorch (ECCV 2018)
☆138Jul 20, 2023Updated 3 years ago
jfzhang95 / pytorch-video-recognition
View on GitHub
PyTorch implemented C3D, R3D, R2Plus1D models for video activity recognition.
☆1,237Dec 27, 2023Updated 2 years ago
tobyperrett / few-shot-action-recognition
View on GitHub
Implementations of some few-shot action recognition methods.
☆43Jun 7, 2021Updated 5 years ago
happyharrycn / actionformer_release
View on GitHub
Code release for ActionFormer (ECCV 2022)
☆570Apr 11, 2024Updated 2 years ago