GeWu-Lab / MS-BotLinks
The offical repo for "Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation", CoRL 2024 (ORAL)
☆14Updated 3 weeks ago
Alternatives and similar repositories for MS-Bot
Users that are interested in MS-Bot are comparing it to the libraries listed below
Sorting:
- [ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction☆88Updated 3 months ago
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆133Updated 3 months ago
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆97Updated 10 months ago
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning☆68Updated 2 months ago
- ☆56Updated this week
- ☆42Updated 9 months ago
- [CoRL2024] Official repo of `A3VLM: Actionable Articulation-Aware Vision Language Model`☆114Updated 9 months ago
- Code for Reinforcement Learning from Vision Language Foundation Model Feedback☆114Updated last year
- 🦾 A Dual-System VLA with System2 Thinking☆66Updated last week
- ☆49Updated 7 months ago
- ☆33Updated 9 months ago
- [IROS 2025] Human Demo Videos to Robot Action Plans☆56Updated last month
- [CVPR 2024] Binding Touch to Everything: Learning Unified Multimodal Tactile Representations☆54Updated 5 months ago
- This is the official repo for [CoRL 2024] Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation☆28Updated 8 months ago
- ☆26Updated last year
- [CoRL 2023] XSkill: cross embodiment skill discovery☆64Updated last year
- ☆68Updated 8 months ago
- [ICLR 25] Code for "Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning"☆84Updated 2 months ago
- An official implementation of Touch100k: A Large-Scale Touch-Language-Vision Dataset for Touch-Centric Multimodal Representation☆26Updated last year
- Manipulate-Anything: Automating Real-World Robots using Vision-Language Models [CoRL 2024]☆36Updated 3 months ago
- [CoRL 2023] REFLECT: Summarizing Robot Experiences for Failure Explanation and Correction☆95Updated last year
- ☆64Updated 5 months ago
- The repo of paper `RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation`☆128Updated 6 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆44Updated last year
- Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"☆99Updated 5 months ago
- Official Repository of SAM2Act☆102Updated 2 weeks ago
- ☆18Updated last year
- [ICML 2025] Rethinking Latent Redundancy in Behavior Cloning: An Information Bottleneck Approach for Robot Manipulation☆27Updated 2 months ago
- [ICRA 2025] In-Context Imitation Learning via Next-Token Prediction☆84Updated 4 months ago
- ☆75Updated 10 months ago