chengaopro / Awesome-EmbodiedAI
A curated list about Awesome Embodied AI works and is still in construct. Now it contains a list of Simulators, Tasks and Datasets.
β31Updated 4 years ago
Alternatives and similar repositories for Awesome-EmbodiedAI:
Users that are interested in Awesome-EmbodiedAI are comparing it to the libraries listed below
- Official codebase for EmbCLIPβ117Updated last year
- A Model for Embodied Adaptive Object Detectionβ45Updated 2 years ago
- π Visual Room Rearrangementβ108Updated last year
- β43Updated 2 years ago
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"β88Updated 2 years ago
- β44Updated 2 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Dataβ40Updated last year
- β45Updated 11 months ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoningβ127Updated last year
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"β43Updated 10 months ago
- Episodic Transformer (E.T.) is a novel attention-based architecture for vision-and-language navigation. E.T. is based on a multimodal traβ¦β90Updated last year
- Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"β77Updated 8 months ago
- β12Updated last year
- Code for training embodied agents using imitation learning at scale in Habitat-Labβ37Updated 2 years ago
- [ICCV 2023] Official code repository for ARNOLD benchmarkβ152Updated 11 months ago
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoningβ47Updated last month
- This repository is the official implementation of *Silver-Bullet-3D* Solution for SAPIEN ManiSkill Challenge 2021β20Updated 3 years ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D Worldβ126Updated 4 months ago
- [ICCV'23] Learning Vision-and-Language Navigation from YouTube Videosβ51Updated 2 months ago
- β66Updated last year
- Official implementation of the NRNS paperβ36Updated 2 years ago
- Code of the CVPR 2021 Oral paper: A Recurrent Vision-and-Language BERT for Navigationβ162Updated 2 years ago
- Code to evaluate a solution in the BEHAVIOR benchmark: starter code, baselines, submodules to iGibson and BDDL reposβ60Updated 11 months ago
- β61Updated 4 months ago
- Affordance Grounding from Demonstration Video to Target Image (CVPR 2023)β43Updated 7 months ago
- β43Updated 2 years ago
- β29Updated 5 months ago
- Implantation of CtrlFormerβ28Updated 2 years ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videosβ162Updated last month
- Official Repository of NeurIPS2021 paper: PTRβ33Updated 3 years ago