LogosRoboticsGroup/4D-VLA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LogosRoboticsGroup/4D-VLA)

LogosRoboticsGroup / 4D-VLA

4D-VLA: Spatiotemporal Vision-Language-Action Pretraining with Cross-Scene Calibration. Accepted to NeurIPS 2025.

☆57

Alternatives and similar repositories for 4D-VLA

Users that are interested in 4D-VLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fudan-zvg / UniUGG
View on GitHub
UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding. Accepted to ICLR 2026.
☆63Updated this week
LogosRoboticsGroup / SPAR
View on GitHub
From Flatland to Space (SPAR). Accepted to NeurIPS 2025 Datasets & Benchmarks. A large-scale dataset & benchmark for 3D spatial perceptio…
☆90Jan 5, 2026Updated 6 months ago
LogosRoboticsGroup / Polaris
View on GitHub
[ICRA 2026] Relative Position Matters: Trajectory Prediction and Planning with Polar Representation
☆15Feb 5, 2026Updated 5 months ago
LogosRoboticsGroup / DeFi
View on GitHub
[ICLR 2026] Disentangled Robot Learning via Separate Forward and Inverse Dynamics Pretraining
☆30Apr 26, 2026Updated 2 months ago
LogosRoboticsGroup / ProphRL
View on GitHub
Reinforcing Action Policies by Prophesying
☆42Nov 26, 2025Updated 7 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
fudan-zvg / RealEngine
View on GitHub
☆47Jun 3, 2025Updated last year
fudan-zvg / tensoflow
View on GitHub
[CVPR 2025] TensoFlow: Tensorial Flow-based Sampler for Inverse Rendering
☆15Sep 20, 2025Updated 10 months ago
Zhangwenyao1 / DreamVLA
View on GitHub
[NeurIPS 2025] DreamVLA: A Vision-Language-Action Model Dreamed with Comprehensive World Knowledge
☆362Jan 6, 2026Updated 6 months ago
fudan-zvg / BezierGS
View on GitHub
[ICCV2025] BézierGS: Dynamic Urban Scene Reconstruction with Bézier Curve Gaussian Splatting
☆137Sep 3, 2025Updated 10 months ago
xinyuguo1566 / PriorVLA
View on GitHub
Official implementation of PriorVLA.
☆17May 11, 2026Updated 2 months ago
baaivision / UniVLA
View on GitHub
[ICLR 2026] Unified Vision-Language-Action Model
☆314Oct 15, 2025Updated 9 months ago
fudan-zvg / diffusion-square
View on GitHub
[ICLR 2025] Diffusion²: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models
☆60Mar 18, 2025Updated last year
ethz-mrl / VidBot
View on GitHub
[CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation
☆52Apr 10, 2026Updated 3 months ago
OpenHelix-Team / CEED-VLA
View on GitHub
[ECCV 2026] Official implementation of CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding.
☆51Sep 15, 2025Updated 10 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
fudan-zvg / GS-LiDAR
View on GitHub
[ICLR 2025] GS-LiDAR: Generating Realistic LiDAR Point Clouds with Panoramic Gaussian Splatting
☆155Mar 18, 2025Updated last year
OpenHelix-Team / Spatial-Forcing
View on GitHub
Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model [ICLR2026]
☆263Jul 7, 2026Updated 2 weeks ago
RoboDita / Dita
View on GitHub
ICCV2025
☆171Dec 10, 2025Updated 7 months ago
fudan-zvg / BridgeAD
View on GitHub
[CVPR 2025] Bridging Past and Future: End-to-End Autonomous Driving with Historical Prediction and Planning
☆108Apr 7, 2025Updated last year
fudan-zvg / DriveX
View on GitHub
[ICCV 2025] Driving Scene Synthesis on Free-form Trajectories with Generative Prior
☆41Jun 28, 2025Updated last year
LogosRoboticsGroup / SGDrive
View on GitHub
[CVPR2026] SGDrive: Scene-to-Goal Hierarchical World Cognition for Autonomous Driving
☆71Jul 2, 2026Updated 2 weeks ago
fudan-zvg / ImagiDrive
View on GitHub
[ICRA2026] ImagiDrive: A Unified Imagination-and-Planning Framework for Autonomous Driving
☆23Mar 17, 2026Updated 4 months ago
Hoyyyaard / 3DFlowAction
View on GitHub
☆59Jul 6, 2025Updated last year
alibaba-damo-academy / RynnVLA-002
View on GitHub
RynnVLA-002: A Unified Vision-Language-Action and World Model
☆1,098Dec 2, 2025Updated 7 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
declare-lab / nora-1.5
View on GitHub
NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards
☆109Jan 11, 2026Updated 6 months ago
xiaoxiao0406 / VQ-VLA
View on GitHub
The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
☆134Nov 15, 2025Updated 8 months ago
intuitive-robots / flower_vla_pret
View on GitHub
[CoRL 2025] Pretraining code for FLOWER VLA on OXE
☆41Sep 22, 2025Updated 10 months ago
CladernyJorn / UP-VLA
View on GitHub
Official PyTorch implementation for ICML 2025 paper: UP-VLA.
☆61Jan 20, 2026Updated 6 months ago
ControlVLA / ControlVLA
View on GitHub
Code Repository for ControlVLA, CoRL2025.
☆98Oct 26, 2025Updated 8 months ago
LogosRoboticsGroup / SeerDrive
View on GitHub
[NeurIPS 2025] Future-Aware End-to-End Driving: Bidirectional Modeling of Trajectory Planning and Scene Evolution
☆70Feb 4, 2026Updated 5 months ago
JackHck / FVP
View on GitHub
[ICCV 2025] FVP: 4D Visual Pre-training for Robot Learning
☆17Sep 5, 2025Updated 10 months ago
HHYHRHY / MM-ACT
View on GitHub
[CVPR'2026] "MM-ACT: Learn from Multimodal Parallel Generation to Act"
☆117Mar 13, 2026Updated 4 months ago
nicehiro / Awesome-Vision-Language-Action-Models
View on GitHub
☆14Jul 6, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kuai-lab / cvpr26_Dynamic-eDiTor
View on GitHub
Official code of "Dynamic-eDiTor: Training-Free Text-Driven 4D Scene Editing with Multimodal Diffusion Transformer"
☆18May 28, 2026Updated last month
mlzxy / rla-wm
View on GitHub
Learning Visual Feature-Based World Models via Residual Latent Action
☆42May 11, 2026Updated 2 months ago
OpenHelix-Team / HiF-VLA
View on GitHub
[CVPR 2026] HiF-VLA: An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model
☆74Mar 11, 2026Updated 4 months ago
zhoubohan0 / NOLO
View on GitHub
[IROS 2025 oral] Official implementation of NOLO: Navigate Only Look Once
☆21Nov 13, 2025Updated 8 months ago
umd-huang-lab / tracevla
View on GitHub
☆75Jan 8, 2025Updated last year
ucd-dare / VITA
View on GitHub
Flowing from Vision to Action: Noise-Free Flow Matching Policy Learning 🎉[ICLR 2026]
☆135May 14, 2026Updated 2 months ago
tsinghua-fib-lab / RoboScape
View on GitHub
☆26Jun 29, 2025Updated last year