fkenghagho / RobotVQALinks
RobotVQA is a project that develops a Deep Learning-based Cognitive Vision System to support household robots' perception while they perfom human-scale daily manipulation tasks like cooking in a normal kitchen. The system relies on dense description of objects in the scene and their relationships
☆18Updated last year
Alternatives and similar repositories for RobotVQA
Users that are interested in RobotVQA are comparing it to the libraries listed below
Sorting:
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆98Updated 8 months ago
- code for TIDEE: Novel Room Reorganization using Visuo-Semantic Common Sense Priors☆40Updated 2 years ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆45Updated last year
- Codebase for HiP☆90Updated 2 years ago
- Instruction Following Agents with Multimodal Transforemrs☆53Updated 3 years ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆45Updated 2 years ago
- Voltron Evaluation: Diverse Evaluation Tasks for Robotic Representation Learning☆37Updated 2 years ago
- General-purpose Visual Understanding Evaluation☆20Updated 2 years ago
- 🔀 Visual Room Rearrangement☆124Updated 2 years ago
- ☆33Updated last year
- Implementation of Language-Conditioned Path Planning (Amber Xie, Youngwoon Lee, Pieter Abbeel, Stephen James)☆25Updated 2 years ago
- Code for the paper Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration☆105Updated 3 years ago
- ☆46Updated 2 years ago
- Official codebase for EmbCLIP☆131Updated 2 years ago
- ☆56Updated last year
- PyTorch implementation of the Hiveformer research paper☆49Updated 2 years ago
- Chain-of-Thought Predictive Control☆57Updated 2 years ago
- Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"☆87Updated last year
- ☆44Updated 3 years ago
- Code to evaluate a solution in the BEHAVIOR benchmark: starter code, baselines, submodules to iGibson and BDDL repos☆69Updated last year
- This repository is the official implementation of *Silver-Bullet-3D* Solution for SAPIEN ManiSkill Challenge 2021☆20Updated 4 years ago
- Code and models of MOCA (Modular Object-Centric Approach) proposed in "Factorizing Perception and Policy for Interactive Instruction Foll…☆40Updated last year
- ☆33Updated last year
- ☆45Updated 2 years ago
- Repository of our accepted NeurIPS-2022 paper "Towards Versatile Embodied Navigation"☆21Updated 3 years ago
- Code for "Learning Affordance Landscapes for Interaction Exploration in 3D Environments" (NeurIPS 20)☆38Updated 2 years ago
- Code for the RSS 2023 paper "Energy-based Models are Zero-Shot Planners for Compositional Scene Rearrangement"☆21Updated 2 years ago
- ☆38Updated 2 years ago
- [CVPR 2022] Joint hand motion and interaction hotspots prediction from egocentric videos☆71Updated 2 years ago
- Visual Grounding of Referring Expressions for Human-Robot Interaction☆26Updated 7 years ago