MINT-SJTU / Evo-VLALinks
Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.
☆37Updated 3 months ago
Alternatives and similar repositories for Evo-VLA
Users that are interested in Evo-VLA are comparing it to the libraries listed below
Sorting:
- [RA-L] Lost & Found dynamically tracks object poses from egocentric videos while updating a scene graph, enabling richer semantic 3D unde…☆49Updated 2 weeks ago
- A curated list of awesome exploration policy papers.☆11Updated 3 months ago
- [ICCV 2025] GLEAM: Learning Generalizable Exploration Policy for Active Mapping in Complex 3D Indoor Scene☆135Updated 3 weeks ago
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆43Updated 2 months ago
- Geometry-aware 4D Video Generation for Robot Manipulation☆60Updated last month
- [CVPR 2024] GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction☆71Updated 3 months ago
- ☆84Updated 9 months ago
- code for CoRL2025 "LaDiWM: A Latent Diffusion-based World Model for Predictive Manipulation"☆30Updated last week
- ☆32Updated 5 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆167Updated 3 months ago
- [RA-L 2025] Dynamic Open-Vocabulary 3D Scene Graphs for Long-term Language-Guided Mobile Manipulation☆107Updated 6 months ago
- AIR-Embodied: An Efficient Active 3DGS-based Interaction and Reconstruction Framework with Embodied Large Language Model☆19Updated 5 months ago
- [IROS 2024] Incrementally Building Room-Scale Language-Embedded Gaussian Splats (LEGS) with a Mobile Robot☆35Updated 5 months ago
- [SIGGRAPH Asia 2024 Conference] PC-Planner: Physics-Constrained Self-Supervised Learning for Robust Neural Motion Planning with Shape-Awa…☆17Updated last year
- [RAL 2024] OpenObj: Open-Vocabulary Object-Level Neural Radiance Fields with Fine-Grained Understanding☆28Updated 8 months ago
- Dynamic 3D Gaussian Scene Graphs for Environment Adaptation☆49Updated this week
- Official implementation of CVPR25 paper "Decompositional Neural Scene Reconstruction with Generative Diffusion Prior"☆97Updated 6 months ago
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆28Updated 6 months ago
- Official Implementation for “CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World” (RSS 2025).☆35Updated 5 months ago
- [NeurIPS2024] Multiview Scene Graph (topologically representing a scene from unposed images by interconnected place and object nodes)☆120Updated 3 weeks ago
- [ICRA 2024 Oral] Open-Fusion: Real-time Open-Vocabulary 3D Mapping and Queryable Scene Representation☆133Updated last year
- ☆41Updated 7 months ago
- Official implementation of Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation (CoRL'24).☆72Updated 7 months ago
- ☆38Updated 5 months ago
- ☆68Updated 2 months ago
- [ICCV2025] Extrapolated Urban View Synthesis Benchmark☆44Updated 2 weeks ago
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆18Updated 2 weeks ago
- Splat-MOVER: Multi-Stage, Open-Vocabulary Robotic Manipulation via Editable Gaussian Splatting☆40Updated last year
- Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.☆55Updated 2 weeks ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆114Updated 4 months ago