LogosRoboticsGroup / 4D-VLALinks
4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration. Accepted to NeurIPS 2025.
☆38Updated 4 months ago
Alternatives and similar repositories for 4D-VLA
Users that are interested in 4D-VLA are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆168Updated 4 months ago
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆119Updated 5 months ago
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆100Updated last month
- [NeurIPS 2025]Genesis: Multimodal Driving Scene Generation with Spatio-Temporal and Cross-Modal Consistency☆64Updated last month
- [ICCV 2025] Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding☆62Updated 10 months ago
- Official PyTorch implementation for ICML 2025 paper: UP-VLA.☆49Updated 4 months ago
- ☆70Updated 3 months ago
- ☆56Updated 5 months ago
- Geometry-aware 4D Video Generation for Robot Manipulation☆62Updated 2 months ago
- Open-source implementations on real robots☆34Updated 11 months ago
- Project Page for GaussianFormer☆24Updated last year
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆84Updated 11 months ago
- [RA-L 2024] DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction☆82Updated last year
- ICCV 2025☆15Updated 3 weeks ago
- DSPv2: Improved Dense Policy for Effective and Generalizable Whole-body Mobile Manipulation☆22Updated last month
- ☆36Updated 3 weeks ago
- [ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models☆73Updated 4 months ago
- [ICCV 2025] IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation☆49Updated 3 months ago
- ☆126Updated 9 months ago
- Official implementation of ReconVLA: Reconstructive Vision-Language-Action Model as Effective Robot Perceiver.☆58Updated last month
- [ICCV 2025] Detect Anything 3D in the Wild☆219Updated 4 months ago
- official code of *DOME: Taming Diffusion Model into High-Fidelity Controllable Occupancy World Model*☆51Updated 9 months ago
- [ICCV2025] BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting☆102Updated 2 months ago
- ☆46Updated 6 months ago
- [CVPR 2025]Lift3D Foundation Policy: Lifting 2D Large-Scale Pretrained Models for Robust 3D Robotic Manipulation☆166Updated 4 months ago
- [CVPR 2024] Memory-based Adapters for Online 3D Scene Perception☆121Updated 7 months ago
- ☆94Updated 10 months ago
- [ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenes☆64Updated last year
- A Comprehensive Survey on World Models for Embodied AI☆100Updated this week
- [ECCV 2024] Occupancy as Set of Points☆90Updated last year