lhc1224 / Cross-View-AG
Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022
☆49Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Cross-View-AG
- LOCATE: Localize and Transfer Object Parts for Weakly Supervised Affordance Grounding (CVPR 2023)☆33Updated last year
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆39Updated 3 months ago
- One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)☆17Updated 3 months ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆72Updated 4 months ago
- ☆33Updated 3 weeks ago
- [ICRA 2024] Language-Conditioned Affordance-Pose Detection in 3D Point Clouds☆22Updated last month
- This the official repository of OCL (ICCV 2023).☆18Updated 7 months ago
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆54Updated 9 months ago
- ☆17Updated 3 months ago
- [IROS 2023] Open-Vocabulary Affordance Detection in 3d Point Clouds☆53Updated 2 months ago
- CVPR 2024 "Instance Tracking in 3D Scenes from Egocentric Videos"☆16Updated 4 months ago
- ☆26Updated last month
- Code and data release for the paper "Learning Object State Changes in Videos: An Open-World Perspective" (CVPR 2024)☆30Updated 2 months ago
- Official implementation of Language Conditioned Spatial Relation Reasoning for 3D Object Grounding (NeurIPS'22).☆54Updated last year
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆59Updated last month
- An Examination of the Compositionality of Large Generative Vision-Language Models☆20Updated 7 months ago
- Official implementation of "Towards Generalizable Vision-Language Robotic Manipulation: A Benchmark and LLM-guided 3D Policy."☆25Updated last month
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆34Updated last year
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆36Updated last year
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆53Updated last month
- ☆32Updated last year
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆117Updated last year
- ☆13Updated last week
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆28Updated 4 months ago
- [ICCV'23] Learning Vision-and-Language Navigation from YouTube Videos☆41Updated last year
- ☆25Updated last year
- PyTorch implementation of Robot Latent Diffusion☆13Updated 3 months ago
- ☆45Updated last month
- ☆12Updated 5 months ago
- A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.☆95Updated last year