MohitShridhar / ingress
Visual Grounding of Referring Expressions for Human-Robot Interaction
☆26Updated 5 years ago
Related projects: ⓘ
- SNARE Dataset with MATCH and LaGOR models☆23Updated 5 months ago
- 3D household task-based dataset created using customised AI2-THOR.☆14Updated 2 years ago
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆37Updated 2 months ago
- This repository is the official implementation of *Silver-Bullet-3D* Solution for SAPIEN ManiSkill Challenge 2021☆20Updated 2 years ago
- ☆11Updated 4 months ago
- Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"☆66Updated 2 months ago
- Learning about objects and their properties by interacting with them☆12Updated 3 years ago
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆51Updated 7 months ago
- Code for "Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation"☆62Updated 4 years ago
- Code for "Learning Affordance Landscapes for Interaction Exploration in 3D Environments" (NeurIPS 20)☆34Updated last year
- Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"☆36Updated 3 years ago
- PyTorch code for the paper: "Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation"☆19Updated 3 years ago
- A repo for processing the raw hand object detections to produce releasable pickles + library for using these☆33Updated 2 years ago
- public video dqn code☆24Updated last year
- Code Repository for Regression Planning Networks☆59Updated last month
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆83Updated last year
- PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation☆118Updated 11 months ago
- The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`☆43Updated 2 years ago
- large scale pretrain for navigation task☆85Updated last year
- Official code for the ACL 2021 Findings paper "Yichi Zhang and Joyce Chai. Hierarchical Task Learning from Language Instructions with Uni…☆24Updated 3 years ago
- ☆11Updated 5 years ago
- Code for ECCV 2020 paper - LEMMA: A Multi-view Dataset for LEarning Multi-agent Multi-task Activities☆27Updated 3 years ago
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆79Updated last year
- ☆11Updated last year
- Evaluating pre-trained navigation agents under corruptions☆27Updated 3 years ago
- Codebase for the Airbert paper☆41Updated last year
- Official Repository of NeurIPS2021 paper: PTR☆33Updated 2 years ago
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆29Updated 2 years ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆34Updated 10 months ago
- Repository of our accepted NeurIPS-2022 paper "Towards Versatile Embodied Navigation"☆20Updated last year