eldar / vdpmLinks
Official implementation of Video-DPM
☆56Updated this week
Alternatives and similar repositories for vdpm
Users that are interested in vdpm are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…☆155Updated 3 months ago
- ☆123Updated 7 months ago
- The official implementation of InfiniteVGGT☆219Updated last week
- [ICCV 2025] This is the official implementation of POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction☆117Updated 5 months ago
- Official Implementation of paper "St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World"☆105Updated 4 months ago
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆134Updated 7 months ago
- [NeurIPS 2025]"DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling"☆91Updated last month
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆175Updated 3 months ago
- StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams☆72Updated 7 months ago
- Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction☆125Updated last week
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆40Updated 5 months ago
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆219Updated 7 months ago
- ☆67Updated last year
- Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) …☆87Updated 2 months ago
- Code for Faster VGGT with Block-Sparse Global Attention☆88Updated 2 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆42Updated 5 months ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆100Updated 3 months ago
- Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”☆73Updated last month
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆58Updated 5 months ago
- Trace Anything: Representing Any Video in 4D via Trajectory Fields☆456Updated 2 months ago
- Any4D: Unified Feed-Forward Metric 4D Reconstruction☆234Updated last month
- "E-RayZer: Self-supervised 3D Reconstruction as Spatial Visual Pre-training" official implementation.☆226Updated 3 weeks ago
- Official implementation of CVPR25 paper "Decompositional Neural Scene Reconstruction with Generative Diffusion Prior"☆105Updated 9 months ago
- Official implementation of "E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models"☆104Updated 7 months ago
- "VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames"☆90Updated 6 months ago
- Official implementation of EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting☆52Updated 7 months ago
- Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image☆60Updated 3 weeks ago
- ☆104Updated 4 months ago
- Unifying 2D and 3D Vision-Language Understanding☆119Updated 5 months ago
- Code repository for "DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers"☆74Updated 2 months ago