fudan-zvg / WoVoGen
WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation
☆82Updated 2 months ago
Related projects ⓘ
Alternatives and complementary repositories for WoVoGen
- ☆25Updated 3 weeks ago
- [WACV'25] Calib3D: Calibrating Model Preferences for Reliable 3D Scene Understanding☆44Updated 7 months ago
- MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning☆62Updated 8 months ago
- DynamicCity: Large-Scale LiDAR Generation from Dynamic Scenes☆47Updated 3 weeks ago
- [ECCV 2024] Occupancy as Set of Points☆81Updated 4 months ago
- ☆52Updated last year
- [ECCV 2024] 4D Contrastive Superflows are Dense 3D Representation Learners☆41Updated 2 months ago
- [CVPR 2024] Symphonies (Scene-from-Insts): Symphonize 3D Semantic Scene Completion with Contextual Instance Queries☆168Updated 4 months ago
- ☆26Updated 2 months ago
- [NeurIPS 2024] DrivingDojo Dataset: Advancing Interactive and Knowledge-Enriched Driving World Model☆29Updated last week
- [CVPR 2024] The official implementation for "SemCity: Semantic Scene Generation with Triplane Diffusion"☆161Updated 6 months ago
- Official Code Release of Delphi☆52Updated 5 months ago
- [CVPR 2024] Official PyTorch Code of SeaBird: Segmentation in Bird's View with Dice Loss Improves Monocular 3D Detection of Large Objects☆85Updated 3 weeks ago
- ☆21Updated 7 months ago
- [ECCV 2024] TOD3Cap: Towards 3D Dense Captioning in Outdoor Scenes☆101Updated 3 months ago
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understanding☆48Updated 3 weeks ago
- [CVPR 2023] MV-JAR: Masked Voxel Jigsaw and Reconstruction for LiDAR-Based Self-Supervised Pre-Training☆47Updated last year
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)☆106Updated last month
- FreeVS: Generative View Synthesis on Free Driving Trajectory☆57Updated 3 weeks ago
- [ECCV 2024] Official implementation for "RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception"☆21Updated 4 months ago
- [ECCV 2024] Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions☆74Updated last month
- (ICCV2023) MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection☆80Updated 11 months ago
- Official code of "Segment any 3D Object with Language"☆37Updated 6 months ago
- ☆15Updated last year
- CVPR 2023 Use NeRF-generated images to train your model.☆74Updated last year
- Official PyTorch Implementation of HTCL (ECCV 2024): Hierarchical Temporal Context Learning for Camera-based Semantic Scene Completion☆34Updated 4 months ago
- ☆85Updated 7 months ago
- Multi-Space Alignments Towards Universal LiDAR Segmentation☆39Updated 3 weeks ago
- ☆45Updated 11 months ago
- OccSora: 4D Occupancy Generation Models as World Simulators for Autonomous Driving☆148Updated 5 months ago