nvidia-cosmos / cosmos-transfer1
Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.
☆379Updated this week
Alternatives and similar repositories for cosmos-transfer1:
Users that are interested in cosmos-transfer1 are comparing it to the libraries listed below
- Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆180Updated last week
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long c…☆305Updated last month
- Official PyTorch Implementation of "History-Guided Video Diffusion"☆296Updated 2 months ago
- A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆252Updated 5 months ago
- Aether: Geometric-Aware Unified World Modeling☆292Updated last month
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆248Updated 6 months ago
- ☆129Updated last month
- ☆159Updated 2 months ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆491Updated 6 months ago
- Generative World Explorer☆143Updated 5 months ago
- ☆265Updated 3 weeks ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆244Updated 3 months ago
- [NeurIPS 2024] A Generalizable World Model for Autonomous Driving☆724Updated 4 months ago
- Official repo and evaluation implementation of VSI-Bench☆475Updated 2 months ago
- PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos☆204Updated this week
- Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).☆343Updated last month
- [CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control☆538Updated 2 months ago
- ☆380Updated last year
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆474Updated 5 months ago
- Benchmarking physical understanding in generative video models☆158Updated last week
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆186Updated 4 months ago
- Code for PhysDreamer☆557Updated 2 months ago
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆155Updated last week
- [ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".☆64Updated last month
- Official implementation of Continuous 3D Perception Model with Persistent State☆772Updated 3 weeks ago
- GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors☆270Updated last week
- ☆126Updated 4 months ago
- Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"☆1,157Updated this week
- [ICLR 2025 Spotlight] MetaUrban: An Embodied AI Simulation Platform for Urban Micromobility☆162Updated 2 months ago
- [ICML 2024] Official code repository for 3D embodied generalist agent LEO☆436Updated 2 weeks ago