ZhaoYujie2002 / LangSplatV2Links
LangSplatV2: High-dimensional 3D Language Gaussian Splatting with 450+ FPS
☆91Updated 3 weeks ago
Alternatives and similar repositories for LangSplatV2
Users that are interested in LangSplatV2 are comparing it to the libraries listed below
Sorting:
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆120Updated 3 months ago
- ☆89Updated last month
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆140Updated 2 months ago
- Official implementation of EPiC: Efficient Video Camera Control Learning with Precise Anchor-Video Guidance☆40Updated 2 months ago
- Self-reimplemented version of 4D-LRM.☆48Updated 2 months ago
- Official implementation of "Seurat: From Moving Points to Depth", CVPR 2025 Highlight☆63Updated 3 months ago
- ☆29Updated 2 months ago
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆45Updated 4 months ago
- [arXiv 2024] Novel View Extrapolation with Video Diffusion Priors☆119Updated 5 months ago
- [CVPR 2025] Official code for the paper "SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis"☆96Updated 4 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆95Updated 4 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆37Updated 5 months ago
- [ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency☆200Updated 3 months ago
- Official code for "JAFAR: Jack up Any Feature at Any Resolution"☆147Updated 2 weeks ago
- Seeing World Dynamics in a Nutshell☆109Updated 4 months ago
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆182Updated 2 months ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆124Updated 2 weeks ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆87Updated 4 months ago
- [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion☆268Updated 3 weeks ago
- The official implementation of "Compositional Generative Model of Unbounded 4D Cities". (arXiv 2501.08983)☆102Updated 6 months ago
- [CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆111Updated last month
- [ECCV 2024] Pytorch code for our ECCV'24 paper NeRF-MAE: Masked AutoEncoders for Self-Supervised 3D Representation Learning for Neural Ra…☆103Updated 4 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆235Updated this week
- ☆72Updated 2 months ago
- 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding☆52Updated this week
- [ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory☆304Updated last week
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆168Updated 3 months ago
- Official repository for "Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation" (ICLR2025)☆71Updated 3 months ago
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆208Updated last week
- [CVPR2025] Official Implementation of AuraFusion360☆73Updated 3 weeks ago