nvidia-cosmos / cosmos-transfer1Links
Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.
☆444Updated this week
Alternatives and similar repositories for cosmos-transfer1
Users that are interested in cosmos-transfer1 are comparing it to the libraries listed below
Sorting:
- Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆227Updated this week
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long c…☆435Updated this week
- [ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"☆334Updated this week
- A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆262Updated 6 months ago
- Aether: Geometric-Aware Unified World Modeling☆326Updated this week
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆482Updated 6 months ago
- Official implementation of Continuous 3D Perception Model with Persistent State☆816Updated last month
- Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset☆277Updated last week
- ☆143Updated 3 weeks ago
- [RSS 2025] Learning to Act Anywhere with Task-centric Latent Actions☆351Updated this week
- A curated list of awesome 3D scene generation papers. (arXiv 2505.05474)☆368Updated this week
- Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"☆1,174Updated 3 weeks ago
- [NeurIPS 2024] A Generalizable World Model for Autonomous Driving☆742Updated 5 months ago
- Orient Anything, ICML 2025☆276Updated 2 weeks ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆293Updated 4 months ago
- Generative World Explorer☆143Updated 6 months ago
- ☆162Updated 3 months ago
- Theia: Distilling Diverse Vision Foundation Models for Robot Learning☆231Updated 2 months ago
- Benchmarking physical understanding in generative video models☆168Updated last week
- PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation (ECCV 2024)☆299Updated 7 months ago
- [WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think☆431Updated 5 months ago
- [CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control☆559Updated 2 months ago
- [CVPR 2025] Video Depth without Video Models☆538Updated 2 months ago
- Code for PhysDreamer☆561Updated 3 months ago
- Code release for https://kovenyu.com/WonderWorld/☆555Updated last month
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆253Updated 7 months ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆515Updated 7 months ago
- [CVPR 2025] Prompt Depth Anything☆807Updated 2 months ago
- world modeling challenge for humanoid robots☆485Updated 6 months ago
- PE3R: Perception-Efficient 3D Reconstruction. Take 2 - 3 photos with your phone, upload them, wait a few minutes, and then start explorin…☆358Updated 2 months ago