yiyixuxu / TimeSformer-rolled-attention
Visualizing the learned space-time attention using Attention Rollout
☆34Updated 2 years ago
Alternatives and similar repositories for TimeSformer-rolled-attention:
Users that are interested in TimeSformer-rolled-attention are comparing it to the libraries listed below
- ☆66Updated last year
- ☆54Updated 3 years ago
- Future Transformer for Long-term Action Anticipation (CVPR 2022)☆49Updated 2 years ago
- Code release for ICCV 2021 paper "Anticipative Video Transformer"☆152Updated 3 years ago
- [arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"☆122Updated 6 months ago
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆21Updated 4 months ago
- ☆26Updated last year
- Official code repo for TCLR: Temporal Contrastive Learning for Video Representation [CVIU-2022]☆37Updated 11 months ago
- Official Pytorch Implementation of Relational Self-Attention, NeurIPS 2021☆49Updated 3 years ago
- Implementations of Transformers for Video☆23Updated 3 years ago
- "Object-Region Video Transformers”, Herzig et al., CVPR 2022☆43Updated 2 years ago
- Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"☆106Updated last year
- BEAR: a new BEnchmark on video Action Recognition☆42Updated 10 months ago
- [CVPR 2023] Code for action prediction from videos☆23Updated 11 months ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆116Updated last year
- ☆70Updated last year
- ☆18Updated 10 months ago
- The official project website of "Ske2Grid: Skeleton-to-Grid Representation Learning for Action Recognition" (The paper of Ske2Grid is pub…☆20Updated last year
- [ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?☆41Updated 8 months ago
- ☆16Updated 2 years ago
- Code + pre-trained models for the paper Keeping Your Eye on the Ball Trajectory Attention in Video Transformers☆227Updated 2 years ago
- ☆15Updated last month
- Code Release for MeMViT Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition, CVPR 2022☆148Updated 2 years ago
- [ICCV 2021] MGSampler: An Explainable Sampling Strategy for Video Action Recognition☆48Updated 2 years ago
- Pytorch code for Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning, CVPR2022.☆90Updated last year
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆45Updated last year
- Official PyTorch code for the ICIP 2021 paper 'Syntactically Guided Generative Embeddings For Zero Shot Skeleton Action Recognition'☆29Updated last year
- The implementation of CVPR2021 paper Temporal Query Networks for Fine-grained Video Understanding☆61Updated 2 years ago
- ☆32Updated 2 years ago
- ☆33Updated 9 months ago