lhc1224 / Cross-View-AG
Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022
☆45Updated 3 months ago
Related projects: ⓘ
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆38Updated last month
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆30Updated last year
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆51Updated 7 months ago
- An Examination of the Compositionality of Large Generative Vision-Language Models☆20Updated 5 months ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆70Updated 2 months ago
- ☆13Updated 2 weeks ago
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆27Updated last week
- [CVPR24] Volumetric Environment Representation for Vision-Language Navigation☆27Updated last week
- Code implementation for paper titled "HOI-Ref: Hand-Object Interaction Referral in Egocentric Vision"☆17Updated 5 months ago
- A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.☆95Updated last year
- [ICRA 2024] Language-Conditioned Affordance-Pose Detection in 3D Point Clouds☆17Updated this week
- Official implementation of Language Conditioned Spatial Relation Reasoning for 3D Object Grounding (NeurIPS'22).☆50Updated last year
- ☆34Updated 4 months ago
- This the official repository of OCL (ICCV 2023).☆17Updated 5 months ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆115Updated 11 months ago
- 🔍 Explore Egocentric Vision: research, data, challenges, real-world apps. Stay updated & contribute to our dynamic repository! Work-in-p…☆70Updated 2 months ago
- ☆14Updated last month
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆32Updated last year
- [ECCV2022] A PyTorch implementation of the paper "Spatial and Visual Perspective-Taking via View Rotation and Relation Reasoning for Embo…☆13Updated last year
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated 5 months ago
- ☆29Updated last year
- Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"☆42Updated last year
- ☆52Updated last year
- ☆25Updated 11 months ago
- ☆39Updated last year
- Pytorch implementation of One-Shot Affordance Detection☆59Updated 2 weeks ago
- For Ego4D VQ3D Task☆16Updated 8 months ago
- ☆20Updated 3 months ago
- Code for ECCV2022 Paper "Mining Cross-Person Cues for Body-Part Interactiveness Learning in HOI Detection"☆33Updated last year
- Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2…☆11Updated 3 months ago