4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration. Accepted to NeurIPS 2025.
☆48Jan 10, 2026Updated last month
Alternatives and similar repositories for 4D-VLA
Users that are interested in 4D-VLA are comparing it to the libraries listed below
Sorting:
- From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…☆78Jan 5, 2026Updated last month
- ☆43Jun 3, 2025Updated 8 months ago
- [ICLR 2025] Diffusion²: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models☆56Mar 18, 2025Updated 11 months ago
- [ICLR 2025] GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting☆148Mar 18, 2025Updated 11 months ago
- [ICCV 2025] Driving Scene Synthesis on Free-form Trajectories with Generative Prior☆39Jun 28, 2025Updated 8 months ago
- [ICCV 2025] Official repo of "EC-Flow: Enabling Versatile Robotic Manipulation from Action-Unlabeled Videos via Embodiment-Centric Flow"☆27Oct 16, 2025Updated 4 months ago
- [CVPR 2025] Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning☆93Apr 7, 2025Updated 10 months ago
- Code accompanying the paper entitled LEVIO: Lightweight Embedded Visual Inertial Odometry for Resource-Constrained Devices☆33Updated this week
- [CVPR 2025] TensoFlow: Tensorial Flow-based Sampler for Inverse Rendering☆15Sep 20, 2025Updated 5 months ago
- FieldGen is a semi-automatic data generation framework that enables scalable collection of diverse, high-quality real-world manipulation …☆25Oct 28, 2025Updated 4 months ago
- [CVPR'2025] "DexHandDiff: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation"☆20Jul 3, 2025Updated 7 months ago
- This repository provides tutorial, which discusses running sample publisher and subscriber using multiple transports of point_cloud_trans…☆11Updated this week
- [ICRA 2026] 🌠 DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation☆29Jan 14, 2026Updated last month
- Galaxea's first diffusion policy release☆38Aug 18, 2025Updated 6 months ago
- NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards☆93Jan 11, 2026Updated last month
- Pi0-VLA Repository of "MotionTrans: Human VR Data Enable Motion-Level Learning for Robotic Manipulation Policies"☆26Sep 25, 2025Updated 5 months ago
- ☆14Jul 6, 2025Updated 7 months ago
- ☆43Updated this week
- Extended implementation of RoboDexVLM (IROS 2025)☆31Nov 13, 2025Updated 3 months ago
- learn TensorRT from scratch🥰☆18Sep 29, 2024Updated last year
- AnyPos: Automated Task-Agnostic Actions for Bimanual Manipulation☆35Jul 25, 2025Updated 7 months ago
- SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving☆42Jan 16, 2026Updated last month
- [WACV 2025 Oral] Transferring Foundation Models for Generalizable Robotic Manipulation☆26Mar 28, 2025Updated 11 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆60Aug 19, 2025Updated 6 months ago
- [ICCV2025] BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting☆127Sep 3, 2025Updated 5 months ago
- N2M: Bridging Navigation and Manipulation by Learning Initial Pose Preference from Rollout☆28Nov 21, 2025Updated 3 months ago
- [CoRL 2025] GC-VLN: Instruction as Graph Constraints for Training-free Vision-and-Language Navigation☆63Sep 16, 2025Updated 5 months ago
- [AAAI 2026] Official code for MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Man…☆65Jul 31, 2025Updated 7 months ago
- [ICCV 2025] DyWA:Dynamics-adaptive World Action Model for Generalizable Non-prehensile Manipulation☆74Sep 23, 2025Updated 5 months ago
- SceneComplete: Open-World 3D Scene Completion in Complex Real World Environments for Robot Manipulation☆25Jan 18, 2026Updated last month
- VLA-JEPA: Enhancing Vision-Language-Action Model with Latent World Model☆58Feb 15, 2026Updated 2 weeks ago
- REALM: A Real-to-Sim Validated Benchmark for Generalization in Robotic Manipulation☆46Updated this week
- AC-DiT: Adaptive Coordination Diffusion Transformer for Mobile Manipulation☆31Updated this week
- [CoRL 2025] Robot Learning from Any Images☆34Nov 11, 2025Updated 3 months ago
- Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis (ICCV 2025, Oral)☆39Oct 31, 2025Updated 4 months ago
- Code for "ACG: Action Coherence Guidance for Flow-based Vision-Language-Action Models" (ICRA 2026)☆61Feb 21, 2026Updated last week
- Official code for AAAI 2026 paper (One-Step Generative Policies with Q-Learning: A Reformulation of MeanFlow)☆23Dec 15, 2025Updated 2 months ago
- Code & data for "RoboGround: Robotic Manipulation with Grounded Vision-Language Priors" (CVPR 2025)☆38May 25, 2025Updated 9 months ago
- [ICLR 2026] From Seeing to Experiencing: Scaling Navigation Foundation Models with Reinforcement Learning☆50Feb 13, 2026Updated 2 weeks ago