Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces
☆87Jun 6, 2025Updated 11 months ago
Alternatives and similar repositories for VeBrain
Users that are interested in VeBrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of SPGrasp: A framework for dynamic grasp synthesis from sparse spatiotemporal prompts.☆20Jan 6, 2026Updated 4 months ago
- Code&Data for Grounded 3D-LLM with Referent Tokens☆134Jan 5, 2025Updated last year
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆44Dec 9, 2024Updated last year
- [CVPR 2025] Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training☆109Jul 18, 2025Updated 10 months ago
- LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussians☆25Jan 10, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR2026 Highlight] Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens https://arxiv.org/abs…☆54Apr 10, 2026Updated last month
- KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation☆22Apr 23, 2025Updated last year
- MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation☆70Nov 10, 2025Updated 6 months ago
- Official repo and evaluation implementation of VSI-Bench☆708Aug 5, 2025Updated 9 months ago
- ☆22Jul 22, 2025Updated 9 months ago
- [CVPR 2025] The offical Implementation of "Universal Actions for Enhanced Embodied Foundation Models"☆240Nov 6, 2025Updated 6 months ago
- Annotated Tutorial for PerAct