allenai / interactron
A Model for Embodied Adaptive Object Detection
☆42Updated 2 years ago
Related projects: ⓘ
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆115Updated 11 months ago
- Dataset and baseline for Scenario Oriented Object Navigation (SOON)☆17Updated 2 years ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentation☆70Updated 2 months ago
- ☆13Updated last year
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).☆32Updated last year
- Can 3D Vision-Language Models Truly Understand Natural Language?☆20Updated 5 months ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated 5 months ago
- Official Repository of NeurIPS2021 paper: PTR☆33Updated 2 years ago
- ☆34Updated 4 months ago
- Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"☆36Updated 3 years ago
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆50Updated last year
- Generalization Beyond Data Imbalance: A Controlled Study on CLIP for Transferable Insights☆16Updated 3 months ago
- ☆22Updated 2 years ago
- The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`☆43Updated 2 years ago
- Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling @ CVPR22☆42Updated last year
- A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.☆95Updated last year
- Repository of our accepted CVPR2022 paper "Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-La…☆28Updated 2 years ago
- SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)☆30Updated 2 years ago
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning☆33Updated 2 years ago
- ☆25Updated 11 months ago
- ☆67Updated last year
- Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"☆66Updated 2 months ago
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆37Updated 2 months ago
- Pytorch implementation of One-Shot Affordance Detection☆59Updated 2 weeks ago
- 🐍 A Python Package for Seamless Data Distribution in AI Workflows☆19Updated 9 months ago
- ☆20Updated 3 months ago
- Official Implementation of Learning Navigational Visual Representations with Semantic Map Supervision (ICCV2023)☆24Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- Official PyTorch Implementation of Learning Affordance Grounding from Exocentric Images, CVPR 2022☆45Updated 3 months ago
- ☆32Updated 5 months ago