[CVPR 2026] WorldStereo: Bridging Camera-Guided Video Generation and Scene Reconstruction via 3D Geometric Memories (WorldExpand of HY-World 2.0)
☆164Apr 24, 2026Updated last month
Alternatives and similar repositories for WorldStereo
Users that are interested in WorldStereo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR 2026] NOVA3R: Non-pixel-aligned Visual Transformer for Amodal 3D Reconstruction☆136Updated this week
- Consistent Autoregressive Video Generation with Long Context☆88Feb 6, 2026Updated 4 months ago
- [CVPR 2026] ZipMap: Linear-Time Stateful 3D Reconstruction via Test-Time Training☆448Updated this week
- ☆79Mar 30, 2026Updated 2 months ago
- ☆41Mar 19, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Code for MegaSynth: Scaling Up 3D Scene Reconstruction with Synthesized Data (CVPR 2025)☆203May 20, 2025Updated last year
- ☆48Apr 15, 2026Updated 2 months ago
- [3DV 2026] FastMesh: Efficient Artistic Mesh Generation via Component Decoupling☆132Nov 11, 2025Updated 7 months ago
- [CVPR 2025] MaskGaussian: Adaptive 3D Gaussian Representation from Probabilistic Masks☆99May 12, 2026Updated last month
- [ICLR 2026] FantasyWorld: Geometry-Consistent World Modeling via Unified Video and 3D Prediction☆295Feb 25, 2026Updated 3 months ago
- ☆23Dec 11, 2024Updated last year
- Joint 3D Geometry Reconstruction and Motion Generation for 4D Synthesis from a Single Image☆68May 8, 2026Updated last month
- Implementation of Gamma-World: Generative Multi-Agent World Modeling Beyond Two Players☆607May 28, 2026Updated 2 weeks ago
- [ICCV 2025] MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network☆34Dec 16, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [ICLR 2025] Diffusion²: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models☆58Mar 18, 2025Updated last year
- The first open-domain closed-loop revisited benchmark for evaluating memory consistency and action control in world models.☆66May 25, 2026Updated 3 weeks ago
- [Official Repo] SpatialEdit: Benchmarking Fine-Grained Image Spatial Editing☆211Apr 13, 2026Updated 2 months ago
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆136Jun 10, 2025Updated last year
- [ICML 2026] World-R1: Reinforcing 3D Constraints for Text-to-Video Generation☆385Jun 3, 2026Updated last week
- [CVPR 2026 Highlight & Best Paper of VideoWorldModel Workshop] NeoVerse: Enhancing 4D World Model with in-the-wild Monocular Videos☆609May 12, 2026Updated last month
- PSDR-Room: Single Photo to Scene using Differentiable Rendering (Siggraph Asia 2023)☆34Dec 2, 2023Updated 2 years ago
- Official implementation of Bifrost-1: Bridging Multimodal LLMs and Diffusion Models with Patch-level CLIP Latents (NeurIPS 2025)☆47Nov 24, 2025Updated 6 months ago
- ☆62Mar 24, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official implementation of `AToM: Amortized Text-to-Mesh using 2D Diffusion`☆84Dec 10, 2025Updated 6 months ago
- [NeurIPS 2024 Spotlight] Tetrahedron Splatting for 3D Generation☆178Mar 18, 2025Updated last year
- Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.☆741Nov 25, 2025Updated 6 months ago
- [CVPR2025] MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model☆162Jan 2, 2026Updated 5 months ago
- [ICCV 2025] GameFactory: Creating New Games with Generative Interactive Videos☆489Mar 22, 2025Updated last year
- Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"☆320Mar 30, 2025Updated last year
- [ICLR 2024] Official Implementation of Consistent4D: Consistent 360° Dynamic Object Generation from Monocular Video☆283Nov 14, 2024Updated last year
- Official code for NeurIPS 2024 paper LRM-Zero: Training Large Reconstruction Models with Synthesized Data☆154Oct 7, 2024Updated last year
- [CVPR 2024] BerfScene: Bev-conditioned Equivariant Radiance Fields for Infinite 3D Scene Generation☆45May 7, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Official implementation of NeurIPS 2025 paper "SceneWeaver: All-in-One 3D Scene Synthesis with an Extensible and Self-Reflective Agent"☆158Nov 13, 2025Updated 7 months ago
- We have released official implementation in https://github.com/VAST-AI-Research/MIDI-3D☆128Mar 12, 2025Updated last year
- [NeurIPS 2025]"DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling"☆100Dec 21, 2025Updated 5 months ago
- [NeurIPS2024] DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion☆39Sep 25, 2024Updated last year
- [NeurIPS 2024] Make-it-Real: Unleashing Large Multimodal Model for Painting 3D Objects with Realistic Materials☆187Jul 4, 2024Updated last year
- GenXD: Generating Any 3D and 4D Scenes. ICLR 2025☆223Mar 30, 2025Updated last year
- VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control☆398Feb 26, 2026Updated 3 months ago