yangcaoai / 3DGS-DET
Official codes for paper: 3DGS-DET: Empower 3D Gaussian Splatting with Boundary Guidance and Box-Focused Sampling for 3D Object Detection
β135Updated 4 months ago
Alternatives and similar repositories for 3DGS-DET:
Users that are interested in 3DGS-DET are comparing it to the libraries listed below
- Official implementation of paper "Pyramid Diffusion for Fine 3D Large Scene Generation" (ECCV 2024 Oral)β118Updated 5 months ago
- Pytorch Code for "LEGaussians: Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding"β134Updated 4 months ago
- [CVPR 2024] π‘Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoningβ72Updated last year
- Aether: Geometric-Aware Unified World Modelingβ198Updated this week
- β110Updated 7 months ago
- Seeing World Dynamics in a Nutshellβ99Updated 2 weeks ago
- β32Updated 8 months ago
- The official implementation of SAGS (Segment Anything in 3D Gaussians)β81Updated 10 months ago
- [CVPR2024] SANeRF-HQ: Segment Anything for NeRF in High Quality.β48Updated 9 months ago
- SAMPro3D: Locating SAM Prompts in 3D for Zero-Shot Scene Segmentationβ123Updated last year
- [ECCV'24] OpenIns3D: Snap and Lookup for 3D Open-vocabulary Instance Segmentationβ183Updated 5 months ago
- [3DV 2025 Oral]: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretrainingβ179Updated last week
- [NeurIPS 2024] Lexicon3D: Probing Visual Foundation Models for Complex 3D Scene Understandingβ86Updated 2 months ago
- [CVPR 2024] The official implementation for "SemCity: Semantic Scene Generation with Triplane Diffusion"β179Updated 4 months ago
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videosβ222Updated 6 months ago
- Official code of DMA: Dense Multimodal Alignment for Open-Vocabulary 3D Scene Understanding, ECCV 2024β29Updated 8 months ago
- SceneFun3D ToolKitβ128Updated 2 weeks ago
- β55Updated last month
- [ECCV 2024] Efficient Large-Baseline Radiance Fields, a feed-forward 2DGS modelβ307Updated 8 months ago
- ποΈ [NeurIPS'24] MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Viewsβ238Updated 4 months ago
- [ECCV 2024] Monocular Occupancy Prediction for Scalable Indoor Scenesβ55Updated 6 months ago
- [CVPR 2025] GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understandingβ123Updated last week
- Official implementation of β4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Modelsβ (CVPR 2025)β80Updated 3 weeks ago
- β47Updated last year
- [CVPR 2024] "LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning"; an interactive Large Languβ¦β275Updated 8 months ago
- [NeurIPS 2024] OpenGaussian: Towards Point-Level 3D Gaussian-based Open Vocabulary Understandingβ106Updated 3 months ago
- [ECCV 2024] Implementation of latentSplat: Autoencoding Variational Gaussians for Fast Generalizable 3D Reconstructionβ213Updated 8 months ago
- Official implemetation of the paper "Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting".β176Updated 7 months ago
- PonderV2: Pave the Way for 3D Foundation Model with A Universal Pre-training Paradigmβ337Updated 11 months ago
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Modelsβ173Updated 2 months ago