SenseTime-FVG / UniMLVG
☆8Updated last month
Alternatives and similar repositories for UniMLVG:
Users that are interested in UniMLVG are comparing it to the libraries listed below
- Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆75Updated 2 months ago
- [CVPR 2025 Almost Oral ; )] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆83Updated this week
- [CVPR 2025] DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation☆33Updated this week
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆62Updated 2 months ago
- ☆101Updated 2 months ago
- Doe-1: Closed-Loop Autonomous Driving with Large World Model☆83Updated last month
- A collection of vision foundation models unifying understanding and generation.☆42Updated 2 months ago
- ☆31Updated 3 months ago
- ☆145Updated last month
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆101Updated 3 weeks ago
- PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models☆38Updated last month
- This repository contains the implementation of the paper: "ChatCam: Empowering Camera Control through Conversational AI", NeurIPS 2024.☆10Updated 3 months ago
- CoRL2024 | Hint-AD: Holistically Aligned Interpretability for End-to-End Autonomous Driving☆53Updated 4 months ago
- Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives☆50Updated last week
- [ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving☆22Updated 3 months ago
- [AAAI 2025] DriveDreamer-2: LLM-Enhanced World Models for Diverse Driving Video Generation☆147Updated 2 months ago
- ☆77Updated 2 months ago
- Code for "DrivingWorld: Constructing World Model for Autonomous Driving via Video GPT"☆158Updated last month
- ☆77Updated last month
- Official Implementation of Driv3R☆80Updated 2 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆111Updated this week
- ☆28Updated 5 months ago
- Official Code Release of Delphi☆54Updated 8 months ago
- [CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"☆213Updated 6 months ago
- Simulator-conditioned Driving Scene Generation☆94Updated 3 weeks ago
- FreeVS: Generative View Synthesis on Free Driving Trajectory☆102Updated last week
- [CVPR 2025] StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models☆87Updated 2 months ago
- Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)☆77Updated last month
- [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆48Updated this week