voxposer / voxposer.github.io
☆34Updated 10 months ago
Related projects: ⓘ
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆124Updated this week
- Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World☆115Updated 6 months ago
- ☆32Updated last week
- Embodied Chain of Thought: A robotic policy that reason to solve the task.☆53Updated 3 weeks ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆43Updated 5 months ago
- ☆34Updated last year
- ☆42Updated 2 years ago
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆43Updated 3 months ago
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆58Updated 3 weeks ago
- SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World☆74Updated 2 weeks ago
- Official implementation for paper "EquiBot: SIM(3)-Equivariant Diffusion Policy for Generalizable and Data Efficient Learning".☆98Updated 2 months ago
- [arXiv 2023] Embodied Task Planning with Large Language Models☆148Updated last year
- ☆133Updated 2 months ago
- ☆27Updated 3 months ago
- [ICRA 2024] Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models☆51Updated 7 months ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆63Updated this week
- A Survey of Embodied Learning for Object-Centric Robotic Manipulation☆87Updated 2 weeks ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆69Updated 2 months ago
- [RSS 2024] Learning Manipulation by Predicting Interaction☆78Updated last month
- ☆29Updated this week
- [RSS 2024] "DexCap: Scalable and Portable Mocap Data Collection System for Dexterous Manipulation" code repository☆214Updated last month
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆102Updated 3 months ago
- A Simulation Platform for Embodied AI in Urban Spaces☆71Updated 2 months ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆314Updated 2 months ago
- Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", …☆61Updated last week
- Code for RoboFlamingo☆286Updated 4 months ago
- This repo contains a curative list of robot learning (mainly for manipulation) resources.☆140Updated 3 weeks ago
- Language instructions to mycobot using GPT-4V☆16Updated 9 months ago
- LLM3: Large Language Model-based Task and Motion Planning with Motion Failure Reasoning☆53Updated 3 months ago
- ☆75Updated last year