lucidrains/TimeSformer-pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lucidrains/TimeSformer-pytorch)

lucidrains / TimeSformer-pytorch

Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification

☆730

Alternatives and similar repositories for TimeSformer-pytorch

Users that are interested in TimeSformer-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

facebookresearch / TimeSformer
View on GitHub
The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"
☆1,863Apr 9, 2024Updated 2 years ago
davide-coccomini / TimeSformer-Video-Classification
View on GitHub
The notebook explains the various steps to obtain the results of publication: "Is Space-Time Attention All You Need for Video Understandi…
☆42Mar 19, 2021Updated 5 years ago
lucidrains / STAM-pytorch
View on GitHub
Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification
☆133Apr 1, 2021Updated 5 years ago
Alibaba-MIIL / STAM
View on GitHub
Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)
☆221Aug 23, 2022Updated 3 years ago
SwinTransformer / Video-Swin-Transformer
View on GitHub
This is an official implementation for "Video Swin Transformers".
☆1,666Mar 8, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rishikksh20 / ViViT-pytorch
View on GitHub
Implementation of ViViT: A Video Vision Transformer
☆559Jun 21, 2021Updated 5 years ago
facebookresearch / SlowFast
View on GitHub
PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.
☆7,394Mar 16, 2026Updated 4 months ago
facebookresearch / pytorchvideo
View on GitHub
A deep learning library for video understanding research.
☆3,563May 5, 2026Updated 2 months ago
MCG-NJU / TDN
View on GitHub
[CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition
☆386Sep 17, 2022Updated 3 years ago
mit-han-lab / temporal-shift-module
View on GitHub
[ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding
☆2,215Jul 11, 2024Updated 2 years ago
decisionforce / TPN
View on GitHub
[CVPR 2020] Temporal Pyramid Network for Action Recognition
☆394Jan 12, 2021Updated 5 years ago
yitu-opensource / T2T-ViT
View on GitHub
ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
☆1,194Oct 27, 2023Updated 2 years ago
jayleicn / ClipBERT
View on GitHub
[CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…
☆730Aug 8, 2023Updated 2 years ago
sjenni / temporal-ssl
View on GitHub
Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.
☆49Mar 18, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
antoine77340 / S3D_HowTo100M
View on GitHub
S3D Text-Video model trained on HowTo100M using MIL-NCE
☆200Jul 3, 2020Updated 6 years ago
antoine77340 / MIL-NCE_HowTo100M
View on GitHub
PyTorch GPU distributed training code for MIL-NCE HowTo100M
☆221Jul 5, 2022Updated 4 years ago
lucidrains / vit-pytorch
View on GitHub
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…
☆25,444Jun 22, 2026Updated last month
kenshohara / 3D-ResNets-PyTorch
View on GitHub
3D ResNets for Action Recognition (CVPR 2018)
☆4,036Jan 20, 2021Updated 5 years ago
laura-wang / video-pace
View on GitHub
code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction
☆100May 13, 2021Updated 5 years ago
YuqingWang1029 / VisTR
View on GitHub
[CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers
☆757Jul 15, 2021Updated 5 years ago
VideoNetworks / TokShift-Transformer
View on GitHub
☆70Oct 6, 2023Updated 2 years ago
moabitcoin / ig65m-pytorch
View on GitHub
PyTorch 3D video classification models pre-trained on 65 million Instagram videos
☆265Dec 7, 2019Updated 6 years ago
swathikirans / GSM
View on GitHub
Gate-Shift Networks for Video Action Recognition - CVPR 2020
☆149Jun 19, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lucidrains / glom-pytorch
View on GitHub
An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates concepts from neural fields, top-down-bottom-up proc…
☆196Mar 27, 2021Updated 5 years ago
Sense-X / X-Temporal
View on GitHub
A general video understanding codebase from SenseTime X-Lab
☆444Apr 1, 2021Updated 5 years ago
lucidrains / transformer-in-transformer
View on GitHub
Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…
☆306Dec 27, 2021Updated 4 years ago
TengdaHan / CoCLR
View on GitHub
[NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.
☆288Oct 10, 2021Updated 4 years ago
TengdaHan / MemDPC
View on GitHub
[ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.
☆167Apr 29, 2021Updated 5 years ago
open-mmlab / mmaction
View on GitHub
An open-source toolbox for action understanding based on PyTorch
☆1,877Apr 8, 2022Updated 4 years ago
open-mmlab / mmaction2
View on GitHub
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
☆5,108Mar 18, 2026Updated 4 months ago
lucidrains / halonet-pytorch
View on GitHub
Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones
☆199Mar 24, 2021Updated 5 years ago
The-AI-Summer / self-attention-cv
View on GitHub
Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.
☆1,215Sep 14, 2021Updated 4 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
lucidrains / lambda-networks
View on GitHub
Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute
☆1,528Nov 18, 2020Updated 5 years ago
facebookresearch / deit
View on GitHub
Official DeiT repository
☆4,358Mar 15, 2024Updated 2 years ago
VITA-Group / TransGAN
View on GitHub
[NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang
☆1,693Nov 3, 2022Updated 3 years ago
lucidrains / bottleneck-transformer-pytorch
View on GitHub
Implementation of Bottleneck Transformer in Pytorch
☆677Sep 20, 2021Updated 4 years ago
Chuhanxx / Temporal_Query_Networks
View on GitHub
The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding
☆64Mar 9, 2022Updated 4 years ago
lucidrains / transganformer
View on GitHub
Implementation of TransGanFormer, an all-attention GAN that combines the finding from the recent GanFormer and TransGan paper
☆155Apr 27, 2021Updated 5 years ago
Andy1621 / CT-Net
View on GitHub
[ICLR2021] official implementation of CT-Net
☆37Dec 29, 2021Updated 4 years ago