DirtyHarryLYL / HAKE-AVA
☆27Updated 2 months ago
Alternatives and similar repositories for HAKE-AVA:
Users that are interested in HAKE-AVA are comparing it to the libraries listed below
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆42Updated 2 months ago
- Official Repository for "Diffusion HPC: Generate Synthetic Data for Human Mesh Recovery in Challenging Domains" (3DV 2024 Spotlight)☆43Updated 2 years ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆29Updated last year
- Bidirectional Mapping between Action Physical-Semantic Space☆31Updated 8 months ago
- [ICLR 2023 spotlight] Official PyTorch implementation of the paper "Stochastic Multi-Person 3D Motion Forecasting"☆53Updated last year
- [CVPR2022] SVIP: Sequence VerIfication for Procedures in Videos☆22Updated 2 years ago
- [CVPR 2023] Detecting Human-Object Contact in Images☆55Updated last year
- Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation☆58Updated last year
- [ICCV2023] EgoObjects: A Large-Scale Egocentric Dataset for Fine-Grained Object Understanding☆76Updated last year
- ☆41Updated last week
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"☆36Updated 2 years ago
- Context-Aware Sequence Alignment using 4D Skeletal Augmentation CVPR 2022☆23Updated 2 years ago
- The official code for [ACM MM 2022] 'In-N-Out Generative Learning for Dense Unsupervised Video Segmentation'.☆20Updated 2 years ago
- [ICCV 2023] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation☆22Updated last year
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"☆60Updated last year
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆51Updated last year
- ☆76Updated 2 years ago
- MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations☆34Updated 6 months ago
- TORE: Token Reduction for Efficient Human Mesh Recovery with Transformer☆47Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- Code for recreating the HoS benchmark of VISOR☆21Updated last year
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Updated 2 years ago
- [CVPR 2022 Oral] Versatile Multi-Modal Pre-Training for Human-Centric Perception☆121Updated 2 years ago
- ☆25Updated last year
- [CVPR 2022] Understanding 3D Object Articulation in Internet Videos☆31Updated last year
- ☆34Updated last year
- HInt dataset from HaMeR: Reconstructing Hands in 3D with Transformers☆43Updated last year
- CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"☆18Updated 10 months ago
- ☆10Updated 7 months ago
- Code for the paper: F. Ragusa, G. M. Farinella, A. Furnari. StillFast: An End-to-End Approach for Short-Term Object Interaction Anticipat…☆11Updated 2 years ago