xulingjing88 / WSMA
[AAAI 2024]Weakly Supervised Multimodal Affordance Grounding for Egocentric Images
☆11Updated 2 months ago
Alternatives and similar repositories for WSMA:
Users that are interested in WSMA are comparing it to the libraries listed below
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆51Updated 2 months ago
- ☆25Updated last year
- Official Implementation of Frequency-enhanced Data Augmentation for Vision-and-Language Navigation (NeurIPS2023)☆13Updated last year
- One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)☆22Updated 6 months ago
- ☆15Updated last month
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆25Updated 8 months ago
- ☆21Updated 5 months ago
- Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…☆28Updated last year
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆42Updated 6 months ago
- ☆40Updated last year
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆64Updated 3 months ago
- Official implementation of Language Conditioned Spatial Relation Reasoning for 3D Object Grounding (NeurIPS'22).☆57Updated 2 years ago
- AnyBimanual: Transfering Unimanual Policy for General Bimanual Manipulation☆60Updated last week
- Accepted by CVPR 2024☆30Updated 8 months ago
- [ICCV2023] CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection☆11Updated 7 months ago
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆37Updated 3 weeks ago
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆24Updated last month
- official implementation of NeurIPS 2023 paper "FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation"☆30Updated last year
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆81Updated 4 months ago
- Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding (AAAI'23).☆16Updated last year
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆38Updated last year
- SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆31Updated 3 weeks ago
- Official REVERIE Grounding Model of REVERIE Challenge @ CSIG 2022☆19Updated 2 years ago
- ☆48Updated 3 months ago
- Official implementation of "Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation" (ICCV 2023 Oral)☆16Updated last year
- Official Implementation of CAPEAM (ICCV'23)☆11Updated 2 months ago
- [NeurIPS 2024] Official code repository for MSR3D paper☆33Updated last week
- ☆42Updated last month
- [CVPR 2024] Visual Programming for Zero-shot Open-Vocabulary 3D Visual Grounding☆47Updated 5 months ago
- Latent Motion Token as the Bridging Language for Robot Manipulation☆67Updated last month