xulingjing88 / WSMA
[AAAI 2024]Weakly Supervised Multimodal Affordance Grounding for Egocentric Images
☆12Updated 5 months ago
Alternatives and similar repositories for WSMA:
Users that are interested in WSMA are comparing it to the libraries listed below
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆57Updated 5 months ago
- OpenScan: A Benchmark for Generalized Open-Vocabulary 3D Scene Understanding☆18Updated last week
- One-Shot Open Affordance Learning with Foundation Models (CVPR 2024)☆32Updated 8 months ago
- ☆29Updated 8 months ago
- (ECCV 2024) Official repository of paper "EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding"☆28Updated 2 weeks ago
- For Ego4D VQ3D Task☆19Updated last year
- [NeurIPS 2024] Official code repository for MSR3D paper☆50Updated last week
- ☆25Updated last year
- [ECCV 2024] OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models☆44Updated 3 months ago
- This is the official PyTorch implementation of the CVPR 2023 paper: "GeoVLN: Learning Geometry-Enhanced Visual Representation with Slot A…☆7Updated last year
- This the official repository of OCL (ICCV 2023).☆20Updated last year
- [ACL2023] Official code repository for VLN-Trans☆13Updated last year
- ☆15Updated 10 months ago
- [ECCV 2024] Empowering 3D Visual Grounding with Reasoning Capabilities☆71Updated 6 months ago
- Official implementation of Language Conditioned Spatial Relation Reasoning for 3D Object Grounding (NeurIPS'22).☆60Updated 2 years ago
- Accepted by CVPR 2024☆33Updated 11 months ago
- Code of the ICCV 2023 paper "March in Chat: Interactive Prompting for Remote Embodied Referring Expression"☆26Updated 11 months ago
- Official Pytorch implementation for NeurIPS 2022 paper "Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigati…☆30Updated 2 years ago
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)☆43Updated 9 months ago
- Official repository of DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models☆84Updated 7 months ago
- Official implementation of "Grounded Entity-Landmark Adaptive Pre-training for Vision-and-Language Navigation" (ICCV 2023 Oral)☆18Updated last year
- [CVPR 2024] Code for HiKER-SGG: Hierarchical Knowledge Enhanced Robust Scene Graph Generation☆70Updated 6 months ago
- Official PyTorch implementation of EgoChoir: Capturing 3D Human-Object Interaction Regions from Egocentric Views☆29Updated 7 months ago
- Official implementation of Human-Aware Vision-and-Language Navigation: Bridging Simulation to Reality with Dynamic Human Interactions (Ne…☆36Updated 4 months ago
- ☆21Updated 11 months ago
- ☆120Updated last year
- [ECCV2024, Oral, Best Paper Finalist]This is the official implementation of the paper "LEGO: Learning EGOcentric Action Frame Generation …☆37Updated 2 months ago
- [CVPR 2025] Official PyTorch Implementation of GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmenta…☆34Updated last week
- Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding (AAAI'23).☆17Updated 2 years ago
- [ECCV 2024] EgoCVR: An Egocentric Benchmark for Fine-Grained Composed Video Retrieval☆35Updated 2 weeks ago