Implementation of TimeSformer from Facebook AI, a pure attention-based solution for video classification
☆729Aug 25, 2021Updated 4 years ago
Alternatives and similar repositories for TimeSformer-pytorch
Users that are interested in TimeSformer-pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"☆1,845Apr 9, 2024Updated 2 years ago
- The notebook explains the various steps to obtain the results of publication: "Is Space-Time Attention All You Need for Video Understandi…☆42Mar 19, 2021Updated 5 years ago
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆134Apr 1, 2021Updated 5 years ago
- Official implementation of "An Image is Worth 16x16 Words, What is a Video Worth?" (2021 paper)☆221Aug 23, 2022Updated 3 years ago
- This is an official implementation for "Video Swin Transformers".☆1,647Mar 8, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of ViViT: A Video Vision Transformer☆556Jun 21, 2021Updated 4 years ago
- PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.☆7,339Mar 16, 2026Updated last month
- A deep learning library for video understanding research.☆3,555Jan 12, 2026Updated 3 months ago
- [CVPR 2021] TDN: Temporal Difference Networks for Efficient Action Recognition☆382Sep 17, 2022Updated 3 years ago
- [ICCV 2019] TSM: Temporal Shift Module for Efficient Video Understanding☆2,202Jul 11, 2024Updated last year
- [CVPR 2020] Temporal Pyramid Network for Action Recognition☆392Jan 12, 2021Updated 5 years ago
- ICCV2021, Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet☆1,195Oct 27, 2023Updated 2 years ago
- [CVPR 2021 Best Student Paper Honorable Mention, Oral] Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning…☆730Aug 8, 2023Updated 2 years ago
- Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.☆49Mar 18, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- S3D Text-Video model trained on HowTo100M using MIL-NCE☆200Jul 3, 2020Updated 5 years ago
- PyTorch GPU distributed training code for MIL-NCE HowTo100M☆219Jul 5, 2022Updated 3 years ago
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆25,043Apr 6, 2026Updated last week
- 3D ResNets for Action Recognition (CVPR 2018)☆4,043Jan 20, 2021Updated 5 years ago
- code for our ECCV-2020 paper: Self-supervised Video Representation Learning by Pace Prediction☆100May 13, 2021Updated 4 years ago
- [CVPR2021 Oral] End-to-End Video Instance Segmentation with Transformers☆756Jul 15, 2021Updated 4 years ago
- ☆71Oct 6, 2023Updated 2 years ago
- PyTorch 3D video classification models pre-trained on 65 million Instagram videos☆265Dec 7, 2019Updated 6 years ago
- Gate-Shift Networks for Video Action Recognition - CVPR 2020☆149Jun 19, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A general video understanding codebase from SenseTime X-Lab☆444Apr 1, 2021Updated 5 years ago
- An attempt at the implementation of Glom, Geoffrey Hinton's new idea that integrates concepts from neural fields, top-down-bottom-up proc…☆196Mar 27, 2021Updated 5 years ago
- [NeurIPS'20] Self-supervised Co-Training for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆289Oct 10, 2021Updated 4 years ago
- Implementation of Transformer in Transformer, pixel level attention paired with patch level attention for image classification, in Pytorc…☆310Dec 27, 2021Updated 4 years ago
- [ECCV'20 Spotlight] Memory-augmented Dense Predictive Coding for Video Representation Learning. Tengda Han, Weidi Xie, Andrew Zisserman.☆167Apr 29, 2021Updated 4 years ago
- An open-source toolbox for action understanding based on PyTorch☆1,876Apr 8, 2022Updated 4 years ago
- Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones☆201Mar 24, 2021Updated 5 years ago
- OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark☆4,986Mar 18, 2026Updated last month
- Implementation of LambdaNetworks, a new approach to image recognition that reaches SOTA with less compute☆1,530Nov 18, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implementation of various self-attention mechanisms focused on computer vision. Ongoing repository.☆1,216Sep 14, 2021Updated 4 years ago
- Implementation of Bottleneck Transformer in Pytorch☆677Sep 20, 2021Updated 4 years ago
- Official DeiT repository☆4,338Mar 15, 2024Updated 2 years ago
- [NeurIPS‘2021] "TransGAN: Two Pure Transformers Can Make One Strong GAN, and That Can Scale Up", Yifan Jiang, Shiyu Chang, Zhangyang Wang☆1,692Nov 3, 2022Updated 3 years ago
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆64Mar 9, 2022Updated 4 years ago
- Implementation of TransGanFormer, an all-attention GAN that combines the finding from the recent GanFormer and TransGan paper☆155Apr 27, 2021Updated 4 years ago
- [ICLR2021] official implementation of CT-Net☆37Dec 29, 2021Updated 4 years ago