allenai / interactron
A Model for Embodied Adaptive Object Detection
β45Updated 2 years ago
Alternatives and similar repositories for interactron:
Users that are interested in interactron are comparing it to the libraries listed below
- π A Python Package for Seamless Data Distribution in AI Workflowsβ22Updated last year
- Official Repository of NeurIPS2021 paper: PTRβ33Updated 3 years ago
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Follβ¦β37Updated 10 months ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoningβ131Updated last year
- β17Updated 2 years ago
- A curated list about Awesome Embodied AI works and is still in construct. Now it contains a list of Simulators, Tasks and Datasets.β31Updated 4 years ago
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal traβ¦β90Updated last year
- π Visual Room Rearrangementβ113Updated last year
- SNARE Dataset with MATCH and LaGOR modelsβ24Updated last year
- This repository is the official implementation of Improving Object-centric Learning With Query Optimizationβ50Updated last year
- β53Updated 3 years ago
- The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`β44Updated 3 years ago
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasksβ58Updated 7 months ago
- β25Updated last year
- Official codebase for EmbCLIPβ123Updated last year
- β42Updated last year
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoningβ63Updated 2 years ago
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioningβ35Updated 2 years ago
- Official implementation of Learning from Unlabeled 3D Environments for Vision-and-Language Navigation (ECCV'22).β41Updated 2 years ago
- [NeurIPS 2023] OV-PARTS: Towards Open-Vocabulary Part Segmentationβ81Updated 10 months ago
- β22Updated 3 years ago
- SAT: 2D Semantics Assisted Training for 3D Visual Grounding, ICCV 2021 (Oral)β33Updated 3 years ago
- β60Updated 3 years ago
- Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)β31Updated 2 years ago
- Can 3D Vision-Language Models Truly Understand Natural Language?β21Updated last year
- A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.β97Updated 2 years ago
- [AAAI 2023 Oral] Language-Assisted 3D Feature Learning for Semantic Scene Understandingβ12Updated last year
- Codebase for the Airbert paperβ45Updated 2 years ago
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)β44Updated 9 months ago
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"β91Updated 2 years ago