DeepTimber-Robot-Lab / Tutorial-on-Embodied-AILinks
☆15Updated last year
Alternatives and similar repositories for Tutorial-on-Embodied-AI
Users that are interested in Tutorial-on-Embodied-AI are comparing it to the libraries listed below
Sorting:
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆173Updated 6 months ago
- ☆346Updated this week
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆361Updated 2 months ago
- ☆170Updated 10 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆326Updated 3 months ago
- Ctrl-World: A Controllable Generative World Model for Robot Manipualtion☆244Updated last month
- Towards a Generative 3D World Engine for Embodied Intelligence☆376Updated last week
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆419Updated last month
- VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos☆239Updated 2 weeks ago
- Codebase for Automated Creation of Digital Cousins for Robust Policy Learning☆237Updated 9 months ago
- [ICML 2024] Official code repository for 3D embodied generalist agent LEO☆473Updated 8 months ago
- RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation☆275Updated last month
- WoW (World-Omniscient World Model) is a generative world model trained on 2 million robotic interaction trajectories, designed to imagine…☆127Updated last month
- Official Code for EnerVerse-AC: Envisioning EmbodiedEnvironments with Action Condition☆142Updated 5 months ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆198Updated 8 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆206Updated 2 months ago
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆260Updated 9 months ago
- [NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆218Updated 3 weeks ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆171Updated 6 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆319Updated 4 months ago
- Thinking in 360°: Humanoid Visual Search in the Wild☆105Updated last month
- Unified Vision-Language-Action Model☆257Updated 2 months ago
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆121Updated 2 months ago
- [NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge☆265Updated 3 months ago
- A Comprehensive Survey on World Models for Embodied AI☆184Updated 2 months ago
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆409Updated 2 weeks ago
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model☆152Updated last month
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆125Updated 7 months ago
- [NeurIPS 2025 Spotlight] SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation☆218Updated 6 months ago
- [3DV 2026] SpatialGen: Layout-guided 3D Indoor Scene Generation☆335Updated 2 months ago