allenai / interactronLinks
A Model for Embodied Adaptive Object Detection
☆45Updated 2 years ago
Alternatives and similar repositories for interactron
Users that are interested in interactron are comparing it to the libraries listed below
Sorting:
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆50Updated 2 years ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆57Updated 9 months ago
- The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`☆44Updated 3 years ago
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆71Updated 2 years ago
- Official codebase for EmbCLIP☆126Updated 2 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 2 years ago
- 🐍 A Python Package for Seamless Data Distribution in AI Workflows☆22Updated last year
- ☆22Updated 2 years ago
- ☆42Updated last year
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆38Updated last year
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆135Updated last year
- Code and Data for our CVPR 2021 paper "Structured Scene Memory for Vision-Language Navigation"☆39Updated 3 years ago
- Repository of our accepted CVPR2022 paper "Counterfactual Cycle-Consistent Learning for Instruction Following and Generation in Vision-La…☆28Updated 3 years ago
- Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)☆32Updated 2 years ago
- A curated list about Awesome Embodied AI works and is still in construct. Now it contains a list of Simulators, Tasks and Datasets.☆31Updated 4 years ago
- ☆73Updated 3 years ago
- SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)☆33Updated 3 years ago
- Dataset and baseline for Scenario Oriented Object Navigation (SOON)☆18Updated 3 years ago
- Official Repository of NeurIPS2021 paper: PTR☆33Updated 3 years ago
- A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.☆98Updated 2 years ago
- Code accompanying our ECCV-2020 paper on 3D Neural Listeners.☆129Updated 4 years ago
- Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"☆79Updated last year
- ☆60Updated 3 years ago
- ☆49Updated last year
- ☆13Updated last year
- 🔀 Visual Room Rearrangement☆118Updated last year
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning☆36Updated 2 years ago
- Official implementation of Layout-aware Dreamer for Embodied Referring Expression Grounding [AAAI 23].☆17Updated 2 years ago
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal tra…☆90Updated 2 years ago
- SNARE Dataset with MATCH and LaGOR models☆24Updated last year