DeepTimber-Robot-Lab / Tutorial-on-Embodied-AILinks
☆14Updated last year
Alternatives and similar repositories for Tutorial-on-Embodied-AI
Users that are interested in Tutorial-on-Embodied-AI are comparing it to the libraries listed below
Sorting:
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆302Updated last month
- Towards a Generative 3D World Engine for Embodied Intelligence☆274Updated last week
- [ICCV 2025] Aether: Geometric-Aware Unified World Modeling☆429Updated last month
- [ECCV 2024] ShapeLLM: Universal 3D Object Understanding for Embodied Interaction☆202Updated 10 months ago
- Codebase for Automated Creation of Digital Cousins for Robust Policy Learning☆219Updated 4 months ago
- [CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Langu …☆303Updated last year
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆233Updated 4 months ago
- Paper list in the survey: A Survey on Vision-Language-Action Models: An Action Tokenization Perspective☆178Updated last month
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆252Updated 4 months ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆315Updated last month
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆235Updated last week
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆153Updated last month
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆56Updated 4 months ago
- Code implementation of CVPR 2024 highlight paper "PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI"☆160Updated 2 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆163Updated last month
- ☆167Updated 2 weeks ago
- [ICML 2024] Official code repository for 3D embodied generalist agent LEO☆451Updated 3 months ago
- Official code for the CVPR 2025 paper "Navigation World Models".☆338Updated last week
- [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆143Updated 2 months ago
- ☆163Updated 5 months ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆553Updated 9 months ago
- [arXiv 2025] MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence☆47Updated this week
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆184Updated 3 months ago
- Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environment…☆580Updated this week
- [ICLR 2024 spotlight] Official implementation of "InstructScene: Instruction-Driven 3D Indoor Scene Synthesis with Semantic Graph Prior".☆129Updated last week
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆39Updated 8 months ago
- CVPR2025 | TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation☆22Updated 2 months ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆142Updated 2 months ago
- Orient Anything, ICML 2025☆304Updated 2 months ago
- List of papers on 4D Generation.☆289Updated 10 months ago