yiyixuxu / TimeSformer-rolled-attention
Visualizing the learned space-time attention using Attention Rollout
☆32Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for TimeSformer-rolled-attention
- "Object-Region Video Transformers”, Herzig et al., CVPR 2022☆42Updated 2 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆107Updated last year
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆144Updated last year
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆32Updated 8 months ago
- Code release for ICCV 2021 paper "Anticipative Video Transformer"☆152Updated 2 years ago
- ☆68Updated last year
- [CVPR'23] AdaMAE: Adaptive Masking for Efficient Spatiotemporal Learning with Masked Autoencoders☆72Updated 9 months ago
- ☆52Updated 2 years ago
- Implementations of Transformers for Video☆24Updated 3 years ago
- An unofficial implementation of TubeViT in "Rethinking Video ViTs: Sparse Video Tubes for Joint Image and Video Learning"☆85Updated last month
- Future Transformer for Long-term Action Anticipation (CVPR 2022)☆47Updated last year
- Official Pytorch Implementation of Relational Self-Attention, NeurIPS 2021☆49Updated 2 years ago
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆114Updated 3 months ago
- Pytorch code for Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning, CVPR2022.☆86Updated last year
- BEAR: a new BEnchmark on video Action Recognition☆42Updated 6 months ago
- ☆67Updated 10 months ago
- [AAAI 2023 (Oral)] CrissCross: Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Synchronicity☆23Updated last year
- [BMVC 2021]: Official PyTorch implementation of : "Few Shot Temporal Action Localization using Query Adaptive Transformers"☆20Updated 2 years ago
- ☆51Updated 3 years ago
- Video Test-Time Adaptation for Action Recognition (CVPR 2023)☆34Updated 3 weeks ago
- Implementation of STAM (Space Time Attention Model), a pure and simple attention model that reaches SOTA for video classification☆129Updated 3 years ago
- ☆17Updated 7 months ago
- ☆25Updated last year
- Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"☆103Updated last year
- Official PyTorch code for the ICIP 2021 paper 'Syntactically Guided Generative Embeddings For Zero Shot Skeleton Action Recognition'☆29Updated last year
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆224Updated 2 years ago
- [CVPR 2023] Code for action prediction from videos☆23Updated 8 months ago
- [ECCV 2022] Is Appearance Free Action Recognition Possible?☆58Updated 7 months ago
- [CVPR 2023] HierVL Learning Hierarchical Video-Language Embeddings☆44Updated last year
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆19Updated last month