eldar / vdpmLinks
Official implementation of Video-DPM
☆164Updated 2 weeks ago
Alternatives and similar repositories for vdpm
Users that are interested in vdpm are comparing it to the libraries listed below
Sorting:
- Official Implementation of paper "St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World"☆108Updated 4 months ago
- [NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…☆158Updated 4 months ago
- Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) …☆92Updated 3 months ago
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆221Updated 8 months ago
- ☆123Updated 7 months ago
- The official implementation of InfiniteVGGT☆287Updated 3 weeks ago
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆179Updated 4 months ago
- "E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training" official implementation.☆247Updated last month
- Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction☆172Updated 3 weeks ago
- "VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames"☆92Updated 6 months ago
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆135Updated 7 months ago
- [ICCV 2025] This is the official implementation of POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction☆117Updated 6 months ago
- Any4D: Unified Feed-Forward Metric 4D Reconstruction☆296Updated last month
- StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams☆73Updated 7 months ago
- Orient Anything V2, NeurIPS 2025 Spotlight☆194Updated 3 weeks ago
- [ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields☆489Updated 3 months ago
- DAS3R: Dynamics-Aware Gaussian Splatting for Static Scene Reconstruction☆155Updated 11 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆43Updated 6 months ago
- MEt3R: Measuring Multi-View Consistency in Generated Images☆158Updated 6 months ago
- Seeing World Dynamics in a Nutshell☆111Updated 10 months ago
- [CVPR2025] Feat2GS: Probing Visual Foundation Models with Gaussian Splatting☆230Updated 6 months ago
- Official code for paper: "RayRoPE: Projective Ray Positional Encoding for Multi-view Attention"☆91Updated this week
- [CVPR 2025] GenFusion: Closing the Loop between Reconstruction and Generation via Videos☆159Updated 9 months ago
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆40Updated 5 months ago
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆78Updated last month
- Unified 3D Reconstruction and Semantic Understanding via Generalizable Gaussian Splatting from Unposed Multi-View Images☆119Updated 5 months ago
- [NeurIPS 2025]"DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling"☆93Updated last month
- ☆66Updated last year
- Official implementation of EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting☆54Updated 7 months ago
- ☆278Updated this week