2644521362 / SC-MLLMLinks

☆18

Alternatives and similar repositories for SC-MLLM

Users that are interested in SC-MLLM are comparing it to the libraries listed below

Sorting:

liufanfanlff / RoboUniview
☆57Updated 7 months ago
Hoyyyaard / 3DFlowAction
☆36Updated 3 months ago
aiming-lab / GRAPE
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
☆143Updated 6 months ago
declare-lab / Emma-X
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial Reasoning
☆74Updated 5 months ago
Dantong88 / LLARVA
☆57Updated 10 months ago
vlc-robot / hiveformer
☆33Updated last year
pickxiguapi / Embodied-R1
Official code for "Embodied-R1: Reinforced Embodied Reasoning for General Robotic Manipulation"
☆86Updated last month
RoboDita / Dita
ICCV2025
☆135Updated last month
pipixiaqishi1 / SAM-E
☆44Updated last year
refkxh / C-Instructor
[ECCV 2024] Official implementation of C-Instructor: Controllable Navigation Instruction Generation with Chain of Thought Prompting
☆25Updated 10 months ago
bytedance / GR-MG
Official implementation of GR-MG
☆89Updated 9 months ago
OpenDriveLab / CLOVER
[NeurIPS 2024] CLOVER: Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation
☆129Updated last month
Stanford-ILIAD / explore-eqa
Public release for "Explore until Confident: Efficient Exploration for Embodied Question Answering"
☆67Updated last year
clorislili / ManipLLM
The official codebase for ManipLLM: Embodied Multimodal Large Language Model for Object-Centric Robotic Manipulation(cvpr 2024)
☆141Updated last year
Max-Fu / otter
[ICML 2025] OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction
☆106Updated 6 months ago
OpenDriveLab / MPI
[RSS 2024] Learning Manipulation by Predicting Interaction
☆115Updated 3 months ago
Hoyyyaard / NavGPT
☆10Updated last year
TencentARC / Moto
[ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos
☆135Updated 2 weeks ago
hume-vla / hume
🦾 A Dual-System VLA with System2 Thinking
☆112Updated last month
SiyuanHuang95 / ManipVQA
[IROS24 Oral]ManipVQA: Injecting Robotic Affordance and Physically Grounded Information into Multi-Modal Large Language Models
☆97Updated last year
sled-group / RACER
[ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning
☆36Updated last year
moka-manipulation / moka
MOKA: Open-World Robotic Manipulation through Mark-based Visual Prompting (RSS 2024)
☆86Updated last year
XinyuSun / FGPrompt
official implementation of NeurIPS 2023 paper "FGPrompt: Fine-grained Goal Prompting for Image-goal Navigation"
☆36Updated last year
InternRobotics / InternVLA-A1
InternVLA-A1: Unifying Understanding, Generation, and Action for Robotic Manipulation
☆45Updated last month
EmbodiedCity / Embodied-R.code
☆83Updated 5 months ago
Koorye / Inspire
Official implemetation of the paper "InSpire: Vision-Language-Action Models with Intrinsic Spatial Reasoning"
☆44Updated 2 weeks ago
mees / hulc2
[ICRA2023] Grounding Language with Visual Affordances over Unstructured Data
☆46Updated last year
sled-group / navchat
Code for ICRA24 paper "Think, Act, and Ask: Open-World Interactive Personalized Robot Navigation" Paper//arxiv.org/abs/2310.07968 …
☆31Updated last year
yueyang130 / DeeR-VLA
Official code of paper "DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution"
☆111Updated 8 months ago
Li-ChangHao / CoNav
☆11Updated last year