dominickrei / PoseAwareVT
Code for the paper Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers
☆19Updated last month
Related projects: ⓘ
- ☆13Updated last month
- ☆67Updated 8 months ago
- Official codes for the paper "In the Eye of Transformer: Global-Local Correlation for Egocentric Gaze Estimation".☆19Updated 3 weeks ago
- Code release for "EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone" [ICCV, 2023]☆85Updated 2 months ago
- Annotations for the public release of the EPIC-KITCHENS-100 dataset☆127Updated 2 years ago
- Action Scene Graphs for Long-Form Understanding of Egocentric Videos (CVPR 2024)☆25Updated 2 months ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆27Updated last week
- Code release for ICCV 2021 paper "Anticipative Video Transformer"☆152Updated 2 years ago
- Inceptive Visual Representation Learning with Diverse Attention Across Heads. Image Classification, Action Recognition, and Robot Learnin…☆14Updated this week
- ☆77Updated 2 years ago
- [CVPR 2024 Champions] Solutions for EgoVis Chanllenges in CVPR 2024☆100Updated 2 months ago
- Benchmarking Panoptic Video Scene Graph Generation (PVSG), CVPR'23☆74Updated 4 months ago
- A curated list of papers and resources linked to action anticipation and early action recognition from videos.☆9Updated 3 years ago
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆17Updated 5 months ago
- Video + CLIP Baseline for Ego4D Long Term Action Anticipation Challenge (CVPR 2022)☆13Updated 2 years ago
- Simple PyTorch Dataset for the EPIC-Kitchens-55 and EPIC-Kitchens-100 that handles frames and features (rgb, optical flow, and objects) f…☆22Updated last year
- Code release for "Training a Large Video Model on a Single Machine in a Day"☆107Updated last month
- ☆102Updated 3 months ago
- Python scripts to download Assembly101 from Google Drive☆27Updated last year
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"☆33Updated last year
- [CVPR 2024] Data and benchmark code for the EgoExoLearn dataset☆43Updated 2 weeks ago
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆30Updated 10 months ago
- Code for ECCV2022 "Real-time Online Video Detection with Temporal Smoothing Transformers"☆100Updated last year
- SLIC: Self-Supervised Learning with Iterative Clustering for Human Action Videos [CVPR 2022]☆19Updated last year
- HT-Step is a large-scale article grounding dataset of temporal step annotations on how-to videos☆16Updated 6 months ago
- 🔍 Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-p…☆70Updated 2 months ago
- [NeurIPS2022] Egocentric Video-Language Pretraining☆222Updated 4 months ago
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"☆42Updated last year
- Code and models for the Action Recognition benchmark of Assembly101☆8Updated last year
- Code implementation for our ECCV, 2022 paper titled "My View is the Best View: Procedure Learning from Egocentric Videos"☆24Updated 7 months ago