Official implementation of Video-DPM
☆173Jan 19, 2026Updated last month
Alternatives and similar repositories for vdpm
Users that are interested in vdpm are comparing it to the libraries listed below
Sorting:
- The official implementation of InfiniteVGGT☆302Jan 19, 2026Updated last month
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆30Feb 5, 2026Updated 3 weeks ago
- 🚀 Official code for “XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression”, …☆35Jan 27, 2026Updated last month
- [ICLR 2026] Code for QuantVGGT: Quantized Visual Geometry Grounded Transformer☆92Feb 17, 2026Updated last week
- Accelerate VGGT with efficient desciptor-based global attention☆58Dec 3, 2025Updated 2 months ago
- Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image☆64Dec 23, 2025Updated 2 months ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆58Dec 26, 2025Updated 2 months ago
- ☆33Jan 23, 2026Updated last month
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆179Sep 26, 2025Updated 5 months ago
- Project Page for GaussianFormer☆24May 30, 2024Updated last year
- AnyDepth: Depth Estimation Made Easy☆252Feb 23, 2026Updated last week
- [CVPR25] SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs☆19Aug 27, 2025Updated 6 months ago
- Code for "InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields"☆581Jan 20, 2026Updated last month
- [ICLR2026] Official Implementation of "Dens3R: A Foundation Model for 3D Geometry Prediction"☆370Sep 29, 2025Updated 5 months ago
- Generalizable Perception Stack for all things 3D, 4D & Scene Understanding☆75Dec 15, 2025Updated 2 months ago
- [ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy☆913Sep 26, 2025Updated 5 months ago
- Dynamic 3D Foundation Model using Causal Transformer. [ICLR 2026]☆316Feb 2, 2026Updated 3 weeks ago
- MuM's a pretty good feature extractor for 3D tasks, probably the best.☆71Nov 24, 2025Updated 3 months ago
- [NeurIPS 2025]"DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling"☆96Dec 21, 2025Updated 2 months ago
- [ICLR 2026] Streaming 4D Visual Geometry Transformer☆832Oct 27, 2025Updated 4 months ago
- Unraveling the Effects of Synthetic Data on End-to-End Autonomous Driving☆32Nov 20, 2025Updated 3 months ago
- [CVPR 2026] Official implementation of "MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second".☆405Updated this week
- [ICLR 2025] GI-GS: Global Illumination Decomposition on Gaussian Splatting for Inverse Rendering☆108Apr 8, 2025Updated 10 months ago
- A simple state update rule to enhance length generalization for CUT3R☆586Oct 1, 2025Updated 5 months ago
- [NeurIPS 2025] Official code for Reconstruct, Inpaint, Test-Time Finetune: Dynamic Novel-view Synthesis from Monocular Videos☆83Dec 30, 2025Updated 2 months ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆433Updated this week
- [CVPR 2026] NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos☆358Feb 21, 2026Updated last week
- Code for the ShapeR research paper☆674Feb 5, 2026Updated 3 weeks ago
- Code of WinT3R: Window-Based Streaming Rrconstruction With Camera Token Pool☆221Feb 13, 2026Updated 2 weeks ago
- Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction☆186Jan 14, 2026Updated last month
- [CVPR 2025] "DiffusionSfM: Predicting Structure and Motion via Ray Origin and Endpoint Diffusion" official implementation.☆182Jul 7, 2025Updated 7 months ago
- Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"☆411Nov 24, 2025Updated 3 months ago
- [ICCV 2025] Official implementation of SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts