LabShuHangGU / PerLDiff
PerLDiff: Controllable Street View Synthesis Using Perspective-Layout Diffusion Models
☆43Updated 3 months ago
Alternatives and similar repositories for PerLDiff:
Users that are interested in PerLDiff are comparing it to the libraries listed below
- ☆29Updated 7 months ago
- A curated list of resources focused on Visual AutoRegressive Modeling, makes GPT-style AR models surpass diffusion transformers in image …☆29Updated last month
- Official Code Release of Delphi☆55Updated 10 months ago
- ☆88Updated 3 months ago
- Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆79Updated 4 months ago
- [ICLR 2025] Semi-Supervised Vision-Centric 3D Occupancy World Model for Autonomous Driving☆26Updated 2 months ago
- [ICLR 2025] Official code implementation for the paper "X-Drive: Cross-modality Consistent Multi-Sensor Data Synthesis for Driving Scenar…☆46Updated last month
- ☆26Updated 4 months ago
- [CVPR2024] Official Repository of Paper "Panacea: Panoramic and Controllable Video Generation for Autonomous Driving"☆225Updated 8 months ago
- Official Github Repo for GEM☆41Updated last week
- Street-View Image Generation from a Bird’s-Eye View Layout: Official Codebase☆74Updated last year
- [CVPR 2025] ReconDreamer☆131Updated 4 months ago
- [ECCV 2024] DriveDreamer: Towards Real-world-driven World Models for Autonomous Driving☆23Updated 4 months ago
- ☆20Updated 2 weeks ago
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆103Updated 2 months ago
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆21Updated last week
- [ECCV 2024] Official implementation of "RangeLDM: Fast Realistic LiDAR Point Cloud Generation"☆32Updated 4 months ago
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)☆123Updated 2 weeks ago
- (ECCV'24) Official Implementation of SCP-Diff: Photo-Realistic Semantic Image Synthesis with Spatial-Categorical Joint Prior.☆12Updated 6 months ago
- ☆81Updated 3 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆129Updated 3 weeks ago
- official code of "MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction"☆57Updated 3 weeks ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆66Updated 4 months ago
- ☆45Updated 3 months ago
- ☆46Updated 4 months ago
- This repository is dedicated to Track 2 of the W-CODA 2024 Workshop, "Multimodal Perception and Comprehension of Corner Cases in Autonomo…☆10Updated 10 months ago
- FreeVS: Generative View Synthesis on Free Driving Trajectory☆117Updated last month
- ☆42Updated 3 weeks ago
- ☆38Updated 9 months ago
- [CVPR 2025] DrivingSphere: Building a High-fidelity 4D World for Closed-loop Simulation☆44Updated last month