EmbodiedGPT / EmbodiedGPT_Pytorch
☆339Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for EmbodiedGPT_Pytorch
- RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins☆392Updated last week
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆164Updated last month
- Chain-of-Spot: Interactive Reasoning Improves Large Vision-language Models☆85Updated 8 months ago
- Codebase for the 'BestMan' Mobile Manipulator☆164Updated this week
- This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and fol…☆123Updated 3 months ago
- WorldGPT: Empowering LLM as Multimodal World Model☆123Updated 3 months ago
- [arXiv 2023] Embodied Task Planning with Large Language Models☆156Updated last year
- ☆101Updated 2 weeks ago
- Universal Visual Decomposer: Long-Horizon Manipulation Made Easy☆46Updated last month
- Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model☆334Updated 4 months ago
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆93Updated 2 months ago
- ☆41Updated 7 months ago
- Align 3D Point Cloud with Multi-modalities for Large Language Models☆417Updated 11 months ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆85Updated 4 months ago
- [ICCV23] Bird’s-Eye-View Scene Graph for Vision-Language Navigation☆117Updated 7 months ago
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆61Updated 5 months ago
- ☆77Updated last year
- [ICML 2024] The offical Implementation of "DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning"☆68Updated last month
- [CVPR24] Volumetric Environment Representation for Vision-Language Navigation☆77Updated 2 months ago
- [ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization☆563Updated 5 months ago
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆173Updated 6 months ago
- [CVPR2024] This is the official implement of MP5☆84Updated 4 months ago
- [ECCV 2024] Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?☆149Updated last month
- Code for RoboFlamingo☆312Updated 6 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆184Updated 7 months ago
- Repository of our CVPR2023 paper "Lana: A Language-Capable Navigator for Instruction Following and Generation"☆97Updated last year
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆122Updated 3 weeks ago
- Heterogeneous Pre-trained Transformer (HPT) as Scalable Policy Learner.☆390Updated this week
- A curated list of awesome papers on Embodied AI and related research/industry-driven resources.☆289Updated 3 months ago
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆124Updated 5 months ago