UMass-Foundation-Model / MultiPLY
Code for MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World
☆121Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for MultiPLY
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆57Updated 5 months ago
- ☆29Updated 2 weeks ago
- Code&Data for Grounded 3D-LLM with Referent Tokens☆89Updated last month
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆87Updated last month
- ☆73Updated this week
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆158Updated 3 weeks ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆52Updated 2 weeks ago
- [ICLR 2023] SQA3D for embodied scene understanding and reasoning☆117Updated last year
- Theia: Distilling Diverse Vision Foundation Models for Robot Learning☆160Updated 3 weeks ago
- ☆35Updated this week
- ☆44Updated last month
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆84Updated 4 months ago
- Official repository of Learning to Act from Actionless Videos through Dense Correspondences.☆168Updated 6 months ago
- ☆76Updated 2 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆42Updated 6 months ago
- [RSS 2024] Learning Manipulation by Predicting Interaction☆89Updated 2 months ago
- A repository accompanying the PARTNR benchmark for using Large Planning Models (LPMs) to solve Human-Robot Collaboration or Robot Instruc…☆48Updated this week
- ☆63Updated 2 weeks ago
- LLaRA: Large Language and Robotics Assistant☆153Updated last month
- This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and fol…☆121Updated 2 months ago
- [CoRL 2024] RoboEXP: Action-Conditioned Scene Graph via Interactive Exploration for Robotic Manipulation☆80Updated last month
- [ICCV 2023] Official code repository for ARNOLD benchmark☆138Updated 7 months ago
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆174Updated this week
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆121Updated 4 months ago
- Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"☆34Updated 4 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆179Updated 6 months ago
- Official implementation of GR-MG☆40Updated 2 weeks ago
- SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World☆90Updated this week
- Official implementation of "Data Scaling Laws in Imitation Learning for Robotic Manipulation"☆68Updated 2 weeks ago
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆72Updated 2 months ago