MohitShridhar / ingress
Visual Grounding of Referring Expressions for Human-Robot Interaction
☆26Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for ingress
- SNARE Dataset with MATCH and LaGOR models☆23Updated 7 months ago
- ☆11Updated 6 months ago
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆37Updated 5 months ago
- Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"☆68Updated 4 months ago
- This repository is the official implementation of *Silver-Bullet-3D* Solution for SAPIEN ManiSkill Challenge 2021☆20Updated 2 years ago
- PyTorch code for the paper: "Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation"☆19Updated 3 years ago
- large scale pretrain for navigation task☆86Updated last year
- Code for "Tactical Rewind: Self-Correction via Backtracking in Vision-and-Language Navigation"☆62Updated 5 years ago
- PyTorch code for ICLR 2019 paper: Self-Monitoring Navigation Agent via Auxiliary Progress Estimation☆118Updated last year
- Learning about objects and their properties by interacting with them☆12Updated 4 years ago
- Codebase for the Airbert paper☆42Updated last year
- A repo for processing the raw hand object detections to produce releasable pickles + library for using these☆35Updated 3 weeks ago
- 3D household task-based dataset created using customised AI2-THOR.☆14Updated 2 years ago
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆80Updated last year
- The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`☆44Updated 2 years ago
- Official code for the ACL 2021 Findings paper "Yichi Zhang and Joyce Chai. Hierarchical Task Learning from Language Instructions with Uni…☆24Updated 3 years ago
- Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"☆36Updated 3 years ago
- Code for RSS 2020 paper: Robot Object Retrieval with Contextual Natural Language Queries☆14Updated 2 years ago
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆87Updated last year
- Code for "Learning Affordance Landscapes for Interaction Exploration in 3D Environments" (NeurIPS 20)☆34Updated last year
- Code of the NeurIPS 2021 paper: Language and Visual Entity Relationship Graph for Agent Navigation☆45Updated 3 years ago
- PyTorch implementation of the Hiveformer research paper☆47Updated last year
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆54Updated 9 months ago
- Evaluating pre-trained navigation agents under corruptions☆28Updated 3 years ago
- Code for training embodied agents using imitation learning at scale in Habitat-Lab☆34Updated 2 years ago
- Pushing it out of the Way: Interactive Visual Navigation☆34Updated 9 months ago
- Code accompanying EGO-TOPO: Environment Affordances from Egocentric Video (CVPR 2020)☆29Updated 2 years ago
- ☆59Updated 2 years ago
- Code and data of the Fine-Grained R2R Dataset proposed in the EMNLP 2021 paper Sub-Instruction Aware Vision-and-Language Navigation☆43Updated 3 years ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆36Updated last year