HaoyiZhu / SPAView external linksLinks
[ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
☆172Jun 19, 2025Updated 7 months ago
Alternatives and similar repositories for SPA
Users that are interested in SPA are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆90Oct 14, 2024Updated last year
- Open-source implementations on real robots☆34Nov 25, 2024Updated last year
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆572Oct 26, 2025Updated 3 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆509Dec 4, 2024Updated last year
- [ICLR 2025🎉] This is the official implementation of paper "Robots Pre-Train Robots: Manipulation-Centric Robotic Representation from Lar…☆89Jan 22, 2025Updated last year
- ICCV 2025 | TesserAct: Learning 4D Embodied World Models☆379Aug 4, 2025Updated 6 months ago
- The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)☆110Nov 15, 2025Updated 2 months ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆101Oct 6, 2025Updated 4 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆43Aug 9, 2025Updated 6 months ago
- Official implementation of IROS 2025 paper Pseudo Depth Meets Gaussian: A Feed-forward RGB SLAM Baseline☆50Aug 11, 2025Updated 6 months ago
- [CVPR 2025] Tra-MoE: Learning Trajectory Prediction Model from Multiple Domains for Adaptive Policy Conditioning☆55Apr 1, 2025Updated 10 months ago
- [T-PAMI 2025] PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigm☆370Sep 30, 2025Updated 4 months ago
- ☆432Nov 29, 2025Updated 2 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆176Jun 20, 2025Updated 7 months ago
- Official implementation of Continuous 3D Perception Model with Persistent State☆1,329Aug 27, 2025Updated 5 months ago
- [NeurIPS'24] NeuRodin: A Two-stage Framework for High-Fidelity Neural Surface Reconstruction☆125Sep 26, 2024Updated last year
- [CVPR 2024] Physical Property Understanding from Language-Embedded Feature Fields☆87Nov 16, 2025Updated 2 months ago
- [ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning☆1,630Jan 28, 2026Updated 2 weeks ago
- [CoRL 2023 Oral] GNFactor: Multi-Task Real Robot Learning with Generalizable Neural Feature Fields☆138Dec 28, 2023Updated 2 years ago
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆93Jun 6, 2025Updated 8 months ago
- [CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interface☆150Oct 17, 2024Updated last year
- [CoRL 24] GenDP: 3D Semantic Fields for Category-Level Generalizable Diffusion Policy☆106Oct 24, 2024Updated last year
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆427Jan 7, 2026Updated last month
- DeepVerse: 4D Autoregressive Video Generation as a World Model☆213Aug 11, 2025Updated 6 months ago
- [ICLR 2025 Oral] Seer: Predictive Inverse Dynamics Models are Scalable Learners for Robotic Manipulation☆277Jul 8, 2025Updated 7 months ago
- [RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations☆1,243Oct 17, 2025Updated 3 months ago
- [ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time☆613May 7, 2025Updated 9 months ago
- [CoRL 24 Oral] D^3Fields: Dynamic 3D Descriptor Fields for Zero-Shot Generalizable Rearrangement☆180Nov 2, 2024Updated last year
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆227Oct 28, 2025Updated 3 months ago
- Code of WinT3R: Window-Based Streaming Rrconstruction With Camera Token Pool☆218Sep 21, 2025Updated 4 months ago
- Official PyTorch Implementation of Unified Video Action Model (RSS 2025)☆331Jul 23, 2025Updated 6 months ago
- [NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats☆519Oct 14, 2025Updated 4 months ago
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)☆134Apr 4, 2025Updated 10 months ago
- Official repo and evaluation implementation of VSI-Bench☆670Aug 5, 2025Updated 6 months ago
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆162May 30, 2025Updated 8 months ago
- FieldGen is a semi-automatic data generation framework that enables scalable collection of diverse, high-quality real-world manipulation …☆25Oct 28, 2025Updated 3 months ago
- [CVPR 2025] "DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion" official implementation.☆181Jul 7, 2025Updated 7 months ago
- 🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes. Accepted at RSS 2025.☆645Jun 23, 2025Updated 7 months ago
- Code for NeurIPS 2022 paper "Learning Viewpoint-Agnostic Visual Representations by Recovering Tokens in 3D Space"☆20Apr 20, 2023Updated 2 years ago