yuhengliu02 / pyramid-discrete-diffusionLinks
Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)
☆125Updated 4 months ago
Alternatives and similar repositories for pyramid-discrete-diffusion
Users that are interested in pyramid-discrete-diffusion are comparing it to the libraries listed below
Sorting:
- [CVPR 2024] 🏡Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning☆79Updated last year
- ☆89Updated 7 months ago
- [CVPR 2024] The official implementation for "SemCity: Semantic Scene Generation with Triplane Diffusion"☆192Updated 8 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding☆156Updated 3 weeks ago
- [ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models☆56Updated last month
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆235Updated this week
- [ICCV 2025] DiST-4D: Disentangled Spatiotemporal Diffusion with Metric Depth for 4D Driving Scene Generation☆90Updated 3 weeks ago
- [ICLR 2025 Spotlight] Official implementation for "DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes"☆210Updated last month
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆83Updated 7 months ago
- Official Implementation of Driv3R☆91Updated 7 months ago
- Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detecti…☆147Updated 8 months ago
- official code of "MuDG: Taming Multi-modal Diffusion with Gaussian Splatting for Urban Scene Reconstruction"☆73Updated 4 months ago
- [ICLR 2024] This is the official implementation of our paper "Semantic Flow: Learning Semantic Fields of Dynamic Scenes from Monocular Vi…☆11Updated 10 months ago
- [ICLR 2025] Official code of "Segment any 3D Object with Language"☆50Updated last month
- Official PyTorch implementation of the paper ‘CLIP-GS: CLIP-Informed Gaussian Splatting for Real-time and View-consistent 3D Semantic Und…☆50Updated last year
- [ICLR2024] the official pytorch implementation of UC-NeRF☆126Updated last year
- Official PyTorch implementation of 3D Gaussian Mapping (3DGM)☆84Updated 10 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆87Updated 4 months ago
- [CVPR 2023] Semantic Ray: Learning a Generalizable Semantic Field with Cross-Reprojection Attention☆82Updated last year
- A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)☆228Updated this week
- Project Page for GaussianFormer☆25Updated last year
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆31Updated 5 months ago
- ☆47Updated last year
- "VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames"☆81Updated 3 weeks ago
- official code of CVPR2025 Evolsplat☆49Updated last month
- Seeing World Dynamics in a Nutshell☆109Updated 4 months ago
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆94Updated this week
- [ICCV2023] 🧊FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models☆128Updated 11 months ago
- The official repository of our paper: "Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior"☆114Updated last year
- Implementation of the project: SceneSplat - Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining☆55Updated 3 weeks ago