MINT-SJTU / Evo-VLALinks
Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.
☆30Updated 2 months ago
Alternatives and similar repositories for Evo-VLA
Users that are interested in Evo-VLA are comparing it to the libraries listed below
Sorting:
- [RA-L] Lost & Found dynamically tracks object poses from egocentric videos while updating a scene graph, enabling richer semantic 3D unde…☆46Updated 3 months ago
- ☆26Updated 4 months ago
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆102Updated 5 months ago
- code for CoRL2025 "LaDiWM: A Latent Diffusion-based World Model for Predictive Manipulation"☆24Updated 3 weeks ago
- Geometry-aware 4D Video Generation for Robot Manipulation☆59Updated last month
- ☆75Updated 8 months ago
- Official Implementation for “CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World” (RSS 2025).☆32Updated 5 months ago
- A curated list of awesome exploration policy papers.☆11Updated 2 months ago
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆130Updated 3 weeks ago
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆41Updated last month
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆27Updated 5 months ago
- [SIGGRAPH Asia 2024 Conference] PC-Planner: Physics-Constrained Self-Supervised Learning for Robust Neural Motion Planning with Shape-Awa…☆17Updated 11 months ago
- ☆49Updated last week
- ☆63Updated last month
- AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model☆19Updated 5 months ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆82Updated 3 months ago
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆70Updated 6 months ago
- [CVPR 2024] GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction☆71Updated 2 months ago
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆86Updated 11 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆165Updated 3 months ago
- ☆76Updated last month
- Code for "RL-GSBridge: 3D Gaussian Splatting Based Real2Sim2Real Method for Robotic Manipulation Learning"☆44Updated 4 months ago
- ☆26Updated 9 months ago
- [RSS 2025] PIN-WM : Learning Physics-INformed World Models for Non-Prehensile Manipulation☆34Updated last month
- [IROS 2024] Incrementally Building Room-Scale Language-Embedded Gaussian Splats (LEGS) with a Mobile Robot☆35Updated 4 months ago
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆28Updated 7 months ago
- View-Invariant Policy Learning via Zero-Shot Novel View Synthesis (CoRL 2024)☆22Updated 8 months ago
- Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting☆40Updated 11 months ago
- Dynamic 3D Gaussian Scene Graphs for Environment Adaptation☆48Updated 3 months ago
- [IROS 2025] DynamicPose: Real-time and Robust 6D Object Pose Tracking for Fast-Moving Cameras and Objects☆33Updated 3 months ago