dominickrei / PoseAwareVT
Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
☆21Updated 8 months ago
Alternatives and similar repositories for PoseAwareVT:
Users that are interested in PoseAwareVT are comparing it to the libraries listed below
- A curated list of papers and resources linked to action anticipation and early action recognition from videos.☆9Updated 3 years ago
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆20Updated 2 months ago
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆56Updated 7 months ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆13Updated 2 years ago
- The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detect…☆23Updated 2 weeks ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆24Updated 2 months ago
- Pytorch code for Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning, CVPR2022.☆90Updated last year
- This is the offical repository of LLAVIDAL☆14Updated last month
- Python scripts to download Assembly101 from Google Drive☆41Updated 6 months ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆97Updated 9 months ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Updated last year
- Inceptive Visual Representation Learning with Diverse Attention Across Heads. Image Classification, Action Recognition, and Robot Learnin…☆16Updated this week
- [CVPR 2023] Code for action prediction from videos☆25Updated last year
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆125Updated last year
- Official code implemtation of paper AntGPT: Can Large Language Models Help Long-term Action Anticipation from Videos?☆21Updated 7 months ago
- This is an official implementation of TubeR: Tubelet Transformer for Video Action Detection☆78Updated 2 years ago
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆45Updated last year
- BEAR: a new BEnchmark on video Action Recognition☆43Updated last year
- [ICCVW 2023] Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection☆20Updated last year
- MAtch, eXpand and Improve: Unsupervised Finetuning for Zero-Shot Action Recognition with Language Knowledge (ICCV 2023)☆30Updated last year
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆24Updated last year
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆59Updated 3 months ago
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"☆36Updated 2 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆104Updated last year
- SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]☆19Updated 2 years ago
- Integrating Human Gaze into Attention for Egocentric Activity Recognition (WACV 2021)☆25Updated last year
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Updated 2 years ago
- Code for Diffusion Action Segmentation (ICCV 2023)☆61Updated last year
- The MECCANO Dataset: official repository in which we provide code and models.☆32Updated last year
- [NeurIPS 2021 Spotlight] Official implementation of Long Short-Term Transformer for Online Action Detection☆133Updated 9 months ago