EmbodiedGPT / EgoCOT_Dataset
☆41Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for EgoCOT_Dataset
- ☆61Updated last month
- ☆25Updated last month
- ☆101Updated 2 weeks ago
- NeurIPS 2022 Paper "VLMbench: A Compositional Benchmark for Vision-and-Language Manipulation"☆80Updated last year
- Official Implementation of ReALFRED (ECCV'24)☆24Updated last month
- ☆42Updated 2 years ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆42Updated 7 months ago
- ☆30Updated 3 weeks ago
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆122Updated 3 weeks ago
- ☆80Updated last week
- Prompter for Embodied Instruction Following☆17Updated 11 months ago
- RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆16Updated last month
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆122Updated 5 months ago
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆61Updated 5 months ago
- Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"☆35Updated 4 months ago
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆22Updated last week
- [CVPR2024] This is the official implement of MP5☆84Updated 4 months ago
- [CVPR'24 Highlight] The official code and data for paper "EgoThink: Evaluating First-Person Perspective Thinking Capability of Vision-Lan…☆48Updated 2 weeks ago
- ☆19Updated 4 months ago
- Official code for the paper: Embodied Multi-Modal Agent trained by an LLM from a Parallel TextWorld☆47Updated last month
- ☆46Updated 2 months ago
- Codebase for HiP☆87Updated 11 months ago
- Source codes for the paper "COMBO: Compositional World Models for Embodied Multi-Agent Cooperation"☆25Updated 7 months ago
- Official code release of AAAI 2024 paper SayCanPay.☆36Updated 7 months ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆117Updated last year
- Official implementation of KERM: Knowledge Enhanced Reasoning for Vision-and-Language Navigation (CVPR'23)☆35Updated 3 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆36Updated last year
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆68Updated last month
- ☆33Updated last year
- ☆114Updated 4 months ago