nvidia-cosmos / cosmos-transfer1
Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environments.
☆240Updated last week
Alternatives and similar repositories for cosmos-transfer1:
Users that are interested in cosmos-transfer1 are comparing it to the libraries listed below
- Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆78Updated last week
- Official PyTorch Implementation of "History-Guided Video Diffusion"☆233Updated 3 weeks ago
- Generative World Explorer☆138Updated 4 months ago
- A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆229Updated 3 months ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆237Updated 4 months ago
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long c…☆127Updated last week
- (CVPR 2025) The Scene Language: Representing Scenes with Programs, Words, and Embeddings☆174Updated 3 weeks ago
- ☆254Updated 2 months ago
- ☆157Updated last month
- [CVPR 2024] Probing the 3D Awareness of Visual Foundation Models☆297Updated 8 months ago
- Official implementation of Continuous 3D Perception Model with Persistent State☆684Updated last week
- Aether: Geometric-Aware Unified World Modeling☆120Updated this week
- ☆122Updated 2 months ago
- [ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning☆269Updated last month
- Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).☆336Updated 2 weeks ago
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆234Updated last week
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆215Updated this week
- [CVPR2024 Oral] EscherNet: A Generative Model for Scalable View Synthesis☆335Updated 6 months ago
- ☆85Updated 3 weeks ago
- Gaga: Group Any Gaussians via 3D-aware Memory Bank☆352Updated 3 weeks ago
- "Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Li…☆280Updated 2 months ago
- [ICML 2024] 3D-VLA: A 3D Vision-Language-Action Generative World Model☆455Updated 5 months ago
- We have released official implementation in https://github.com/VAST-AI-Research/MIDI-3D☆129Updated 2 weeks ago
- Benchmarking physical understanding in generative video models☆137Updated last month
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆163Updated 2 weeks ago
- Official Implementation of paper "Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence"☆116Updated 4 months ago
- Depth Any Video with Scalable Synthetic Data (ICLR 2025)☆459Updated 3 months ago
- [ICLR 2025 Spotlight] Official implementation for "DynamicCity: Large-Scale 4D Occupancy Generation from Dynamic Scenes"☆161Updated 2 weeks ago
- [ICLR 2025] LAPA: Latent Action Pretraining from Videos☆199Updated 2 months ago
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆88Updated last week