Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model [ICLR2026]
☆190Mar 12, 2026Updated last week
Alternatives and similar repositories for Spatial-Forcing
Users that are interested in Spatial-Forcing are comparing it to the libraries listed below
Sorting:
- Memory-Dependent Manipulation Benchmark based on RoboTwin☆71Mar 12, 2026Updated last week
- Official repository for the project "TraceGen: World Modeling in 3D Trace-Space Enables Learning from Cross-Embodiment Videos"☆45Feb 20, 2026Updated last month
- https://arxiv.org/pdf/2506.06677☆48Nov 10, 2025Updated 4 months ago
- [arXiv 2024] Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking☆18Apr 4, 2025Updated 11 months ago
- ☆14Jul 28, 2025Updated 7 months ago
- ICCV2025☆163Dec 10, 2025Updated 3 months ago
- ☆56Mar 5, 2026Updated 2 weeks ago
- Official Release of "Mixture of Horizons in Action Chunking"☆45Dec 3, 2025Updated 3 months ago
- [CVPR'2024 Highlight] Gaussian-Flow: 4D Reconstruction with Dynamic 3D Gaussian Particle☆79Mar 22, 2025Updated last year
- ☆28Jan 28, 2026Updated last month
- LLaVA-VLA: A Simple Yet Powerful Vision-Language-Action Model [ICRA 2026]☆185Mar 12, 2026Updated last week
- Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.☆49Sep 15, 2025Updated 6 months ago
- 🔥 The first open-sourced diffusion vision-langauge-action model. [ICLR 2026]☆164Mar 12, 2026Updated last week
- The official implementation of "Template-free Articulated Gaussian Splatting for Real-time Reposable Dynamic View Synthesis"☆27Jul 27, 2025Updated 7 months ago
- MLA: A Multisensory Language-Action Model for Multimodal Understanding and Forecasting in Robotic Manipulation☆65Nov 10, 2025Updated 4 months ago
- [CoRL25] GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data☆349Dec 29, 2025Updated 2 months ago
- [AAAI 2025] The official implementation for the "Motion Decoupled 3D Gaussian Splatting for Dynamic Object Representation"☆18Jul 18, 2025Updated 8 months ago
- [ICLR 2025🎉] Official implementation for paper "ET-SEED: Efficient Trajectory-Level SE(3) Equivariant Diffusion Policy".☆63Nov 3, 2025Updated 4 months ago
- ☆90Sep 23, 2025Updated 5 months ago
- F3RM: Feature Fields for Robotic Manipulation. Official repo for the paper "Distilled Feature Fields Enable Few-Shot Language-Guided Mani…☆218Apr 26, 2024Updated last year
- Official implementation of "OneTwoVLA: A Unified Vision-Language-Action Model with Adaptive Reasoning"☆217May 30, 2025Updated 9 months ago
- Code for Point Policy: Unifying Observations and Actions with Key Points for Robot Manipulation☆89Jul 21, 2025Updated 8 months ago
- [IROS 2025] ManiGaussian++: General Robotic Bimanual Manipulation with Hierarchical Gaussian World Model☆43Jun 26, 2025Updated 8 months ago
- Official implementation of FRAPPE: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment☆35Feb 24, 2026Updated 3 weeks ago
- [CVPR 2026] Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction☆54Updated this week
- EfficientFlow: Efficient Equivariant Flow Policy Learning for Embodied AI☆24Jan 17, 2026Updated 2 months ago
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆108Jan 27, 2026Updated last month
- ☆27Oct 31, 2025Updated 4 months ago
- Official repository of paper "CoMoVi: Co-Generation of 3D Human Motions and Realistic Videos"☆76Jan 16, 2026Updated 2 months ago
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆270Mar 24, 2025Updated 11 months ago
- ☆22May 30, 2025Updated 9 months ago
- ☆11May 24, 2023Updated 2 years ago
- Dexterous Grasping via Temporal Parametric Optimization and Contact Diffusion☆52Jan 25, 2025Updated last year
- [CVPR 2026] MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE☆132Mar 13, 2026Updated last week
- [ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning☆1,508Jan 6, 2026Updated 2 months ago
- Reinforcing Action Policies by Prophesying☆40Nov 26, 2025Updated 3 months ago
- The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)☆112Nov 15, 2025Updated 4 months ago
- [ECCV 2024] Flash-Splat: 3D Reflection Removal with Flash Cues and Gaussian Splats☆27Dec 2, 2025Updated 3 months ago
- ☆69Jan 8, 2025Updated last year