InternRobotics / AetherLinks
[ICCV 2025] Aether: Geometric-Aware Unified World Modeling
☆446Updated last month
Alternatives and similar repositories for Aether
Users that are interested in Aether are comparing it to the libraries listed below
Sorting:
- A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)☆277Updated last week
- Code for Streaming 4D Visual Geometry Transformer☆542Updated last week
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆122Updated 4 months ago
- Orient Anything, ICML 2025☆312Updated 3 months ago
- List of papers on 4D Generation.☆292Updated 10 months ago
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆229Updated 11 months ago
- [ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆306Updated last month
- ☆307Updated 8 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆247Updated 3 weeks ago
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆187Updated 3 months ago
- Official implementation of "MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second".☆318Updated last month
- The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'☆107Updated 3 weeks ago
- [ICCV 2025] Code for Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction☆153Updated this week
- [ICCV 2025] Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency☆208Updated 4 months ago
- [ICCV 2025] A simple training-free approach adapting DUSt3R for dynamic scenes.☆438Updated 4 months ago
- [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion☆276Updated last month
- Towards a Generative 3D World Engine for Embodied Intelligence☆289Updated last week
- [CVPR 2025 Highlight] VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step☆303Updated last month
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆265Updated 9 months ago
- Cameras as Relative Positional Encoding☆530Updated 2 weeks ago
- Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence☆320Updated 2 months ago
- [ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction☆350Updated 2 months ago
- A curated list of awesome 3D scene generation papers. (arXiv 2505.05474)☆644Updated last week
- ☆360Updated 3 weeks ago
- Source code of paper "NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer"☆303Updated 4 months ago
- [ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors☆374Updated 3 weeks ago
- PhysX: Physical-Grounded 3D Asset Generation☆232Updated this week
- [ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"☆395Updated 3 weeks ago
- 🎞️ [NeurIPS'24] MVSplat360: Feed-Forward 360 Scene Synthesis from Sparse Views☆278Updated 8 months ago
- "Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Li…☆317Updated 7 months ago