DeepTimber-Robot-Lab / Tutorial-on-Embodied-AILinks
☆15Updated last year
Alternatives and similar repositories for Tutorial-on-Embodied-AI
Users that are interested in Tutorial-on-Embodied-AI are comparing it to the libraries listed below
Sorting:
- Codebase for Automated Creation of Digital Cousins for Robust Policy Learning☆238Updated 10 months ago
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆369Updated 3 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆426Updated 2 weeks ago
- WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagine…☆138Updated last month
- Ctrl-World: A Controllable Generative World Model for Robot Manipualtion☆260Updated 2 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆334Updated 4 months ago
- VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos☆285Updated 2 weeks ago
- Towards a Generative 3D World Engine for Embodied Intelligence☆387Updated last week
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆175Updated 7 months ago
- ☆379Updated last week
- [ICML 2024] LEO: An Embodied Generalist Agent in 3D World☆475Updated 9 months ago
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆264Updated 10 months ago
- Causal video-action world model for generalist robot control☆289Updated last week
- ☆169Updated 11 months ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆617Updated last year
- ☆183Updated 6 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆334Updated 5 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆221Updated 3 months ago
- RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation☆281Updated 2 weeks ago
- A Pragmatic VLA Foundation Model☆683Updated last week
- PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation☆342Updated 3 weeks ago
- InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy☆355Updated last month
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆203Updated 9 months ago
- [CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu…☆311Updated last year
- [ECCV 2024] ShapeLLM: Universal 3D Object Understanding for Embodied Interaction☆225Updated last year
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆173Updated 7 months ago
- The official implementation of "DynamicVLA: A Vision-Language-Action Model for Dynamic Object Manipulation". (arXiv 2601.22153)☆69Updated last week
- Code&Data for Grounded 3D-LLM with Referent Tokens☆131Updated last year
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆167Updated 3 months ago
- Official implementation of "RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics"☆57Updated 2 weeks ago