WayneMao / RoboMatrix
The Official Implementation of RoboMatrix
☆90Updated 4 months ago
Alternatives and similar repositories for RoboMatrix
Users that are interested in RoboMatrix are comparing it to the libraries listed below
Sorting:
- ☆59Updated 2 weeks ago
- ☆52Updated 2 months ago
- Official GitHub Repository for Paper "Bridging Zero-shot Object Navigation and Foundation Models through Pixel-Guided Navigation Skill", …☆99Updated 6 months ago
- ☆137Updated last month
- ☆135Updated last month
- HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model☆203Updated 2 weeks ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆131Updated 10 months ago
- Find What You Want: Learning Demand-conditioned Object Attribute Space for Demand-driven Navigation☆59Updated 4 months ago
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆90Updated 8 months ago
- [RSS 2024] Learning Manipulation by Predicting Interaction☆107Updated 8 months ago
- Code of the paper "NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning" (TPAMI 2025)☆68Updated last month
- [RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions☆71Updated this week
- [TMLR 2024] repository for VLN with foundation models☆106Updated last month
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆91Updated 3 months ago
- SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆149Updated last week
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation☆113Updated 5 months ago
- ☆55Updated 2 months ago
- A comprehensive list of excellent research papers, models, datasets, and other resources on Vision-Language-Action (VLA) models in roboti…☆107Updated this week
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆173Updated 3 weeks ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆110Updated 7 months ago
- Code for LGX (Language Guided Exploration). We use LLMs to perform embodied robot navigation in a zero-shot manner.☆60Updated last year
- Towards Long-Horizon Vision-Language Navigation: Platform, Benchmark and Method (CVPR-25)☆74Updated last week
- Manipulate-Anything: Automating Real-World Robots using Vision-Language Models [CoRL 2024]☆28Updated last month
- Vision-Language Navigation Benchmark in Isaac Lab☆163Updated last month
- [CVPR 2024] The code for paper 'Towards Learning a Generalist Model for Embodied Navigation'☆41Updated last year
- Official implementation of GR-MG☆79Updated 4 months ago
- Official implementation of OpenFMNav: Towards Open-Set Zero-Shot Object Navigation via Vision-Language Foundation Models☆39Updated 7 months ago
- Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"☆55Updated 10 months ago
- PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators☆76Updated 5 months ago
- ☆75Updated 3 weeks ago