GuHuangAI / LaDiWMLinks
code for "LaDiWM: A Latent Diffusion-based World Model for Predictive Manipulation"
☆21Updated 2 weeks ago
Alternatives and similar repositories for LaDiWM
Users that are interested in LaDiWM are comparing it to the libraries listed below
Sorting:
- Official Implementation for “CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World” (RSS 2025).☆33Updated 4 months ago
- X-Sim: Cross-Embodiment Learning via Real-to-Sim-to-Real☆33Updated last week
- ☆74Updated 8 months ago
- [CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation☆26Updated 2 months ago
- Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting☆40Updated 11 months ago
- [RA-L] Lost & Found dynamically tracks object poses from egocentric videos while updating a scene graph, enabling richer semantic 3D unde…☆46Updated 3 months ago
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆86Updated 11 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆83Updated 3 months ago
- View-Invariant Policy Learning via Zero-Shot Novel View Synthesis (CoRL 2024)☆22Updated 8 months ago
- Open-source implementations on real robots☆34Updated 9 months ago
- ☆60Updated last month
- ZeroGrasp: Zero-Shot Shape Reconstruction Enabled Robotic Grasping [CVPR 2025]☆56Updated last month
- [ICCV 2025 Spotlight] DexVLG: Dexterous Vision-Language-Grasp Model at Scale☆34Updated last month
- [IROS 2025] DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects☆33Updated 3 months ago
- [RA-L 2025] VR-Robo: A Real-to-Sim-to-Real Framework for Visual Robot Navigation and Locomotion☆106Updated 2 months ago
- Official implementation of "SUGAR: Pre-training 3D Visual Representations for Robotics" (CVPR'24).☆42Updated 2 months ago
- Official implementation of Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins. (RSS 2025))☆31Updated last month
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆102Updated 5 months ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆27Updated 5 months ago
- ☆84Updated last week
- [SIGGRAPH Asia 2024 Conference] PC-Planner: Physics-Constrained Self-Supervised Learning for Robust Neural Motion Planning with Shape-Awa…☆17Updated 11 months ago
- ☆24Updated 4 months ago
- Code for ICCV 2023 paper "Multi-Object Navigation with dynamically learned neural implicit representations"☆12Updated last year
- [CVPR 2025] RoomTour3D - Geometry-aware, cheap and automatic data from web videos for embodied navigation☆56Updated 6 months ago
- PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators☆96Updated 9 months ago
- ☆36Updated 2 months ago
- ✨✨Official implementation of BridgeVLA☆130Updated 2 months ago
- ☆36Updated last month
- ☆57Updated 8 months ago
- ☆14Updated 3 months ago