[ICLR26] Official implementation of Geometry Forcing: Marrying Video Diffusion and 3D Representation for Consistent World Modeling
☆158Jan 26, 2026Updated last month
Alternatives and similar repositories for GeometryForcing
Users that are interested in GeometryForcing are comparing it to the libraries listed below
Sorting:
- Official implementation of Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse Videos☆40Sep 30, 2025Updated 5 months ago
- [ICLR 2026] PyTorch implementation of "The Less You Depend, The More You Learn: Synthesizing Novel Views from Sparse, Unposed Images with…☆52Jan 26, 2026Updated last month
- [CVPR 2026] Official repo for "EVATok: Adaptive Length Video Tokenization for Efficient Visual Autoregressive Generation"☆37Mar 13, 2026Updated last week
- One4D: Unified 4D Generation and Reconstruction☆92Dec 2, 2025Updated 3 months ago
- [ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction☆419Jun 6, 2025Updated 9 months ago
- [ICCV 2025 ⭐highlight⭐] Implementation of VMem: Consistent Interactive Video Scene Generation with Surfel-Indexed View Memory☆419Jul 25, 2025Updated 7 months ago
- Dynamic 3D Foundation Model using Causal Transformer. [ICLR 2026]☆318Mar 9, 2026Updated last week
- [ICLR 2026] FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction☆243Feb 25, 2026Updated 3 weeks ago
- Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking☆56Jan 5, 2025Updated last year
- the official code of DriveMonkey☆45Updated this week
- SyncNoise: Geometrically Consistent Noise Prediction for Text-based 3D Scene Editing☆19Dec 28, 2024Updated last year
- CoV: Chain-of-View Prompting for Spatial Reasoning☆52Jan 23, 2026Updated last month
- ☆71Nov 5, 2025Updated 4 months ago
- [ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency☆240Oct 29, 2025Updated 4 months ago
- [ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models☆847Dec 17, 2025Updated 3 months ago
- ☆41Mar 19, 2025Updated last year
- Wasserstein Gaussian Splatting☆17Dec 10, 2024Updated last year
- [ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"☆516Aug 4, 2025Updated 7 months ago
- ☆23Mar 13, 2026Updated last week
- TAPIP3D: Tracking Any Point in Persistent 3D Geometry☆382Dec 28, 2025Updated 2 months ago
- ☆124Jun 17, 2025Updated 9 months ago
- ☆33Dec 17, 2025Updated 3 months ago
- [ICLR 2026] Astra : General Interactive World Model with Autoregressive Denoising"☆228Mar 13, 2026Updated last week
- [CVPR2026] Long-horizon, spatially consistent video generation enabled by persistent 3D scene point clouds and dynamic-static disentangle…☆178Mar 15, 2026Updated last week
- [ACMMM 2025] ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆22Jun 20, 2025Updated 9 months ago
- [CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control☆1,303Sep 24, 2025Updated 5 months ago
- 🏠 [ICCV 2025] MEGA: Memory-Efficient 4D Gaussian Splatting for Dynamic Scenes☆57Nov 29, 2024Updated last year
- UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation☆46Aug 26, 2025Updated 6 months ago
- [CVPR 2025] A Unified Image-Dense Annotation Generation Model for Underwater Scenes☆55Apr 9, 2025Updated 11 months ago
- A simple state update rule to enhance length generalization for CUT3R☆609Oct 1, 2025Updated 5 months ago
- [ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time☆344Oct 31, 2025Updated 4 months ago
- 🕹️ Explore cutting-edge techniques in game generation☆65Mar 13, 2026Updated last week
- [ICLR 2026] InstructVLA: Vision-Language-Action Instruction Tuning from Understanding to Manipulation☆108Jan 27, 2026Updated last month
- [ICLR 2026] Streaming 4D Visual Geometry Transformer☆849Oct 27, 2025Updated 4 months ago
- [IV 2025, Oral] Official code of "6Img-to-3D: Few-Image Large-Scale Outdoor Novel View Synthesis"☆81Sep 3, 2025Updated 6 months ago
- [ICLR 2026] Many-for-Many: Unify the Training of Multiple Video and Image Generation and Manipulation Tasks☆30Feb 5, 2026Updated last month
- code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"☆1,193Nov 9, 2025Updated 4 months ago
- [NeurIPS 2025] Official code for paper: Beyond Attention or Similarity: Maximizing Conditional Diversity for Token Pruning in MLLMs.☆92Sep 20, 2025Updated 6 months ago
- [AAAI 2026] Turbo-VAED: Fast and Stable Transfer of Video-VAEs to Mobile Devices☆95Nov 30, 2025Updated 3 months ago