DeepTimber-Robot-Lab / Tutorial-on-Embodied-AILinks
☆15Updated last year
Alternatives and similar repositories for Tutorial-on-Embodied-AI
Users that are interested in Tutorial-on-Embodied-AI are comparing it to the libraries listed below
Sorting:
- [ICML 2024] Official code repository for 3D embodied generalist agent LEO☆465Updated 5 months ago
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆326Updated 3 months ago
- Codebase for Automated Creation of Digital Cousins for Robust Policy Learning☆230Updated 6 months ago
- ☆167Updated 7 months ago
- ☆287Updated last week
- Towards a Generative 3D World Engine for Embodied Intelligence☆317Updated 2 weeks ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆578Updated 11 months ago
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆242Updated 6 months ago
- Paper list in the survey: A Survey on Vision-Language-Action Models: An Action Tokenization Perspective☆283Updated 3 months ago
- The official repo for "SpatialBot: Precise Spatial Understanding with Vision Language Models.☆309Updated last month
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆363Updated 3 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆162Updated 3 months ago
- WorldVLA: Towards Autoregressive Action World Model☆445Updated last week
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆190Updated 5 months ago
- [NeurIPS 2025] Official implementation of "RoboRefer: Towards Spatial Referring with Reasoning in Vision-Language Models for Robotics"☆173Updated 2 weeks ago
- [ICCV 2025] Aether: Geometric-Aware Unified World Modeling☆498Updated 3 months ago
- RynnVLA-001: Using Human Demonstrations to Improve Robot Manipulation☆238Updated 3 weeks ago
- ☆176Updated 2 weeks ago
- Official code for the CVPR 2025 paper "Navigation World Models".☆411Updated 2 months ago
- A curated list of large VLM-based VLA models for robotic manipulation.☆203Updated 2 weeks ago
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆374Updated last week
- InternVLA-M1: A Spatially Guided Vision-Language-Action Framework for Generalist Robot Policy☆129Updated this week
- Video Prediction Policy: A Generalist Robot Policy with Predictive Visual Representations https://video-prediction-policy.github.io☆276Updated 5 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆278Updated last month
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆167Updated 4 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆181Updated this week
- ☆172Updated 2 months ago
- Virtual Community: An Open World for Humans, Robots, and Society☆175Updated last week
- Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks☆174Updated 3 weeks ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆276Updated 2 months ago