DirtyHarryLYL / HAKE-AVALinks
☆28Updated 3 months ago
Alternatives and similar repositories for HAKE-AVA
Users that are interested in HAKE-AVA are comparing it to the libraries listed below
Sorting:
- Repo for "Human-Centric Foundation Models: Perception, Generation and Agentic Modeling" (https://arxiv.org/abs/2502.08556)☆49Updated 4 months ago
- ☆26Updated 3 years ago
- Official Repository for "Diffusion HPC: Generate Synthetic Data for Human Mesh Recovery in Challenging Domains" (3DV 2024 Spotlight)☆43Updated 2 years ago
- [CVPR 2023] Detecting Human-Object Contact in Images☆55Updated last year
- TORE: Token Reduction for Efficient Human Mesh Recovery with Transformer☆47Updated last year
- [ICCV 2023] Global Adaptation meets Local Generalization: Unsupervised Domain Adaptation for 3D Human Pose Estimation☆23Updated last year
- The 1st place solution of 2022 Ego4d Natural Language Queries.☆32Updated 2 years ago
- Code for the paper "Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundatio…☆29Updated last year
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"☆62Updated last year
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆27Updated last year
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"☆37Updated 2 years ago
- Repository of our paper 'Refer-it-in-RGBD' in CVPR 2021☆40Updated last year
- Implementation of paper 'Helping Hands: An Object-Aware Ego-Centric Video Recognition Model'☆33Updated last year
- Code for the paper "GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos" published at CVPR 2024☆52Updated last year
- ☆26Updated last year
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆21Updated 2 months ago
- [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"☆22Updated last year
- The repository contains the official implementation of "DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery"☆23Updated 11 months ago
- MotionBank: A Large-scale Video Motion Benchmark with Disentangled Rule-based Annotations☆35Updated 8 months ago
- Official code of paper "PGT: A Progressive Method for Training Models on Long Videos" on CVPR2021☆30Updated 4 years ago
- CVPR2021: Detecting Human-Object Interaction via Fabricated Compositional Learning☆15Updated 3 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation☆58Updated 2 years ago
- ☆48Updated last month
- [ACM MM 2023] Official implementation of paper "Language-guided Human Motion Synthesis with Atomic Actions".☆29Updated 11 months ago
- [NeurIPS 2022] Official implementation of the paper "Rethinking Resolution in the Context of Efficient Video Recognition".☆31Updated 2 years ago
- Code for recreating the HoS benchmark of VISOR☆22Updated last year
- Python scripts to download Assembly101 from Google Drive☆45Updated 8 months ago
- Official Code for the NeurIPS'23 paper "3D-Aware Visual Question Answering about Parts, Poses and Occlusions"☆17Updated 8 months ago
- Code for our paper: Learning Camera Movement Control from Real-World Drone Videos☆29Updated 2 months ago