2644521362 / SC-MLLM
☆18Updated 9 months ago
Alternatives and similar repositories for SC-MLLM:
Users that are interested in SC-MLLM are comparing it to the libraries listed below
- ☆41Updated 5 months ago
- ☆44Updated 2 months ago
- ☆50Updated 3 weeks ago
- [IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models☆85Updated 6 months ago
- Code for "Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation"☆43Updated 10 months ago
- [NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation☆99Updated 3 months ago
- [RSS 2024] Learning Manipulation by Predicting Interaction☆101Updated 6 months ago
- The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)☆116Updated 8 months ago
- Official implementation of GR-MG☆75Updated 2 months ago
- [ICRA2023] Grounding Language with Visual Affordances over Unstructured Data☆41Updated last year
- ☆28Updated 5 months ago
- ☆31Updated last year
- MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)☆71Updated 7 months ago
- ☆62Updated 3 weeks ago
- [ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆26Updated 5 months ago
- Latent Motion Token as the Bridging Language for Robot Manipulation☆74Updated last month
- GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization☆90Updated last month
- Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"☆43Updated 8 months ago
- Reimplementation of GR-1, a generalized policy for robotics manipulation.☆121Updated 6 months ago
- Code for ICRA24 paper "Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation" Paper//arxiv.org/abs/2310.07968 …☆27Updated 8 months ago
- This is the official repo for [CoRL 2024] Contrastive Imitation Learning for Language-guided Multi-Task Robotic Manipulation☆23Updated 4 months ago
- ☆45Updated 11 months ago
- ☆66Updated last week
- Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning☆48Updated last month
- [CVPR'2024] "SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution"☆57Updated 5 months ago
- [ICLR 25] Code for "Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning"☆45Updated 3 weeks ago
- Official codebase for EmbCLIP☆118Updated last year
- ☆35Updated 3 months ago