dominickrei / PoseAwareVT
Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
☆21Updated 7 months ago
Alternatives and similar repositories for PoseAwareVT:
Users that are interested in PoseAwareVT are comparing it to the libraries listed below
- This is the offical repository of LLAVIDAL☆12Updated last week
- A curated list of papers and resources linked to action anticipation and early action recognition from videos.☆9Updated 3 years ago
- [ECCV 2022] Official Pytorch Implementation of the paper : " Zero-Shot Temporal Action Detection via Vision-Language Prompting "☆104Updated last year
- The MECCANO Dataset: official repository in which we provide code and models.☆32Updated last year
- [CVPR 2024] Code and models for pi-ViT, a video transformer for understanding activities of daily living☆19Updated last month
- BEAR: a new BEnchmark on video Action Recognition☆42Updated 11 months ago
- Inceptive Visual Representation Learning with Diverse Attention Across Heads. Image Classification, Action Recognition, and Robot Learnin…☆16Updated 5 months ago
- Pytorch code for Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning, CVPR2022.☆90Updated last year
- The official PyTorch implementation of the IEEE/CVF Computer Vision and Pattern Recognition (CVPR) '24 paper PREGO: online mistake detect…☆21Updated this week
- "Object-Region Video Transformers”, Herzig et al., CVPR 2022☆45Updated 2 years ago
- [CVPR2023] Masked Video Distillation: Rethinking Masked Feature Modeling for Self-supervised Video Representation Learning (https://arxiv…☆124Updated last year
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆13Updated 2 years ago
- [CVPR 2024 Champions][ICLR 2025] Solutions for EgoVis Chanllenges in CVPR 2024☆124Updated 2 weeks ago
- Python scripts to download Assembly101 from Google Drive☆39Updated 5 months ago
- [ICCV'23] Official PyTorch implementation for paper "Exploring Predicate Visual Context in Detecting Human-Object Interactions"☆74Updated 8 months ago
- Actor-agnostic Multi-label Action Recognition with Multi-modal Query [ICCVW '23]☆23Updated last year
- Implementation of "With a Little Help from my Temporal Context: Multimodal Egocentric Action Recognition, BMVC, 2021" in PyTorch☆19Updated 3 years ago
- Official Implementation of the paper "Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Transl…☆36Updated 2 years ago
- [ICCV 2023] How Much Temporal Long-Term Context is Needed for Action Segmentation?☆41Updated 9 months ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆96Updated 8 months ago
- [CVPR 2023] Code for action prediction from videos☆24Updated last year
- ☆11Updated last year
- Code and models for the Action Recognition benchmark of Assembly101☆10Updated 2 years ago
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"☆36Updated 2 years ago
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆55Updated 6 months ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Updated last year
- Official Repo for CVPR 2024 Paper "FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Fully-Supervised Action Segmentatio…☆59Updated 2 months ago
- [BMVC2022, IJCV2023, Best Student Paper, Spotlight] Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for…☆23Updated last month
- [ICCV 2023] Official implementation of Memory-and-Anticipation Transformer for Online Action Understanding☆46Updated last year
- Annotations for the public release of the EPIC-KITCHENS-100 dataset☆142Updated 2 years ago