allenai / interactronLinks
A Model for Embodied Adaptive Object Detection
☆46Updated 3 years ago
Alternatives and similar repositories for interactron
Users that are interested in interactron are comparing it to the libraries listed below
Sorting:
- IMProv: Inpainting-based Multimodal Prompting for Computer Vision Tasks☆58Updated last year
- This repository is the official implementation of Improving Object-centric Learning With Query Optimization☆51Updated 2 years ago
- General-purpose Visual Understanding Evaluation☆20Updated last year
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆38Updated last year
- Official Repository of NeurIPS2021 paper: PTR☆32Updated 3 years ago
- [ICLR 2022] RelViT: Concept-guided Vision Transformer for Visual Relational Reasoning☆63Updated 3 years ago
- Official codebase for EmbCLIP☆131Updated 2 years ago
- ☆45Updated last year
- 🔀 Visual Room Rearrangement☆122Updated 2 years ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆148Updated last year
- 🐍 A Python Package for Seamless Data Distribution in AI Workflows☆26Updated last year
- EgoTV Egocentric Task Verification from Natural Language Task Descriptions☆27Updated last year
- ☆23Updated 2 years ago
- ☆83Updated last month
- [CVPR 2022 (oral)] Bongard-HOI for benchmarking few-shot visual reasoning☆72Updated 2 years ago
- ☆172Updated 2 years ago
- SNARE Dataset with MATCH and LaGOR models☆24Updated last year
- The repository of ECCV 2020 paper `Active Visual Information Gathering for Vision-Language Navigation`☆43Updated 3 years ago
- ☆54Updated last year
- Code for NeurIPS 2022 Datasets and Benchmarks paper - EgoTaskQA: Understanding Human Tasks in Egocentric Videos.☆35Updated 2 years ago
- A curated list about Awesome Embodied AI works and is still in construct. Now it contains a list of Simulators, Tasks and Datasets.☆30Updated 5 years ago
- ☆127Updated last year
- A collection of 3D vision and language (e.g., 3D Visual Grounding, 3D Question Answering and 3D Dense Caption) papers and datasets.☆100Updated 2 years ago
- (NeurIPS 2022) Self-Supervised Visual Representation Learning with Semantic Grouping☆97Updated 6 months ago
- InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation☆43Updated 3 weeks ago
- ☆11Updated last year
- ☆73Updated 3 years ago
- [CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning☆36Updated 3 years ago
- Pytorch Code and Data for EnvEdit: Environment Editing for Vision-and-Language Navigation (CVPR 2022)☆30Updated 3 years ago
- Dataset and baseline for Scenario Oriented Object Navigation (SOON)☆18Updated 3 years ago