Official implementation of Video-DPM
☆212Jan 19, 2026Updated 3 months ago
Alternatives and similar repositories for vdpm
Users that are interested in vdpm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official implementation of InfiniteVGGT☆352Apr 19, 2026Updated 2 weeks ago
- The official implementation of the paper “VGGT4D: Mining Motion Cues in Visual Geometry Transformers for 4D Scene Reconstruction.”☆248Dec 2, 2025Updated 5 months ago
- [CVPR 2026 (Oral)] MAGICIAN: Efficient Long-Term Planning with Imagined Gaussians for Active Mapping☆78Apr 13, 2026Updated 2 weeks ago
- [ICLR 2026] RefAny3D: 3D Asset-Referenced Diffusion Models for Image Generation☆34Mar 10, 2026Updated last month
- [ICLR 2026] Code for QuantVGGT: Quantized Visual Geometry Grounded Transformer☆104Mar 20, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2026] StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams☆112Mar 11, 2026Updated last month
- Accelerate VGGT with efficient desciptor-based global attention☆70Updated this week
- Generalizable Perception Stack for all things 3D, 4D & Scene Understanding☆84Mar 22, 2026Updated last month
- [ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy☆948Feb 27, 2026Updated 2 months ago
- Video Depth Propagation [3DV 2026]☆35Jan 23, 2026Updated 3 months ago
- [CVPR 2026] Official implementation of "MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second".☆436Mar 19, 2026Updated last month
- ShotStream: Streaming Multi-Shot Video Generation for Interactive Storytelling☆130Mar 31, 2026Updated last month
- [CVPR25] SPC-GS: Gaussian Splatting with Semantic-Prompt Consistency for Indoor Open-World Free-view Synthesis from Sparse Inputs☆18Aug 27, 2025Updated 8 months ago
- AnyDepth: Depth Estimation Made Easy☆283Feb 23, 2026Updated 2 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Project Page for GaussianFormer☆24May 30, 2024Updated last year
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆183Mar 10, 2026Updated last month
- [NeurIPS 2025] Official code for Reconstruct, Inpaint, Test-Time Finetune: Dynamic Novel-view Synthesis from Monocular Videos☆88Dec 30, 2025Updated 4 months ago
- [ICLR2026] Official Implementation of "Dens3R: A Foundation Model for 3D Geometry Prediction"☆380Sep 29, 2025Updated 7 months ago
- A simple state update rule to enhance length generalization for CUT3R☆646Oct 1, 2025Updated 7 months ago
- Dynamic 3D Foundation Model using Causal Transformer. [ICLR 2026]☆355Mar 26, 2026Updated last month
- [ICLR 2026] Implementation of the paper "Learning Unified Representation of 3D Gaussian Splatting". Rethinking 3DGS representation in neu…☆48Mar 17, 2026Updated last month
- [ICLR 2026] Streaming 4D Visual Geometry Transformer☆898Oct 27, 2025Updated 6 months ago
- Official code release for the PVSM paper: "From Rays to Projections: Better Inputs for Feed-Forward View Synthesis"☆44Jan 9, 2026Updated 3 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR26] MuM's a pretty good feature extractor for 3D tasks, probably the best.☆83Apr 6, 2026Updated 3 weeks ago
- [CVPR'2026]: MoRe: Motion-aware Feed-forward 4D Reconstruction Transformer☆60Apr 21, 2026Updated last week
- Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image☆67Dec 23, 2025Updated 4 months ago
- [CVPR 2026] InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields☆949Apr 3, 2026Updated last month
- [ICLR'26] YoNoSplat: You Only Need One Model for Feedforward 3D Gaussian Splatting☆181Feb 25, 2026Updated 2 months ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆63Mar 19, 2026Updated last month
- Drivable 3D Gaussian Avatars - A 3D controllable model for human bodies rendered with Gaussian primitives embedded in tetrahedral cages.☆81Feb 20, 2025Updated last year
- Code of WinT3R: Window-Based Streaming Rrconstruction With Camera Token Pool☆225Mar 4, 2026Updated last month
- [CVPR 2026] Gen3R: 3D Scene Generation Meets Feed-Forward Reconstruction☆265Mar 20, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆462Apr 16, 2026Updated 2 weeks ago
- ☆17Apr 17, 2025Updated last year
- [CVPR 2026 Highlight] NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos☆545Apr 13, 2026Updated 2 weeks ago
- [T-PAMI 2025] The official repo for “GPS-Gaussian+: Generalizable Pixel-Wise 3D Gaussian Splatting for Real-Time Human-Scene Rendering fr…☆67Jan 19, 2026Updated 3 months ago
- 🚀 Official code for “XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression”, …☆45Jan 27, 2026Updated 3 months ago
- ACE-SLAM: Scene Coordinate Regression for Real-Time SLAM☆92Dec 17, 2025Updated 4 months ago
- Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"☆429Nov 24, 2025Updated 5 months ago