Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.
☆452Jun 7, 2026Updated 2 weeks ago
Alternatives and similar repositories for cosmos-predict1
Users that are interested in cosmos-predict1 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environment…☆807Jun 7, 2026Updated 2 weeks ago
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long c…☆952Jun 7, 2026Updated 2 weeks ago
- Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆783Oct 29, 2025Updated 7 months ago
- A suite of image and video neural tokenizers☆1,725Feb 11, 2025Updated last year
- NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomou…☆10,218Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control☆1,367Updated this week
- Official Github Repo for GEM☆113Oct 16, 2025Updated 8 months ago
- Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.☆449Updated this week
- Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …☆1,284Jun 8, 2026Updated last week
- DriveArena: A Closed-loop Generative Simulation Platform for Autonomous Driving☆458Sep 17, 2025Updated 9 months ago
- A Video Tokenizer Evaluation Dataset☆158Jan 13, 2025Updated last year
- [ICCV 2025] Stag-1: Towards Realistic 4D Driving Simulation with Video Generation Model☆97Dec 10, 2024Updated last year
- [ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models☆856Dec 17, 2025Updated 6 months ago
- [CVPR 2024] A world model for autonomous driving.☆434Dec 7, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats☆527Oct 14, 2025Updated 8 months ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆595Oct 26, 2025Updated 7 months ago
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆285Dec 9, 2025Updated 6 months ago
- [NeurIPS 2024] Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models☆343Jan 21, 2025Updated last year
- Official implementation of Continuous 3D Perception Model with Persistent State☆1,436Aug 27, 2025Updated 9 months ago
- Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?☆147Feb 11, 2025Updated last year
- [ECCV 2024] WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation☆114Feb 6, 2025Updated last year
- ☆38Dec 25, 2025Updated 5 months ago
- OASim: an Open and Adaptive Simulator based on Neural Rendering for Autonomous Driving☆246Aug 27, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"☆538Aug 4, 2025Updated 10 months ago
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,615Mar 3, 2026Updated 3 months ago
- Code for FLIP: Flow-Centric Generative Planning for General-Purpose Manipulation Tasks☆83Dec 12, 2024Updated last year
- [ICCV 2025] Repo for Objaverse++, Curated 3D Object Dataset with Quality Annotations☆109Dec 4, 2025Updated 6 months ago
- Code release for https://kovenyu.com/WonderWorld/☆734Apr 14, 2025Updated last year
- Project Lyra: Open Generative 3D World Models☆2,091Jun 11, 2026Updated last week
- [CVPR'25 Oral] Official implementation for "DiffusionRenderer: Neural Inverse and Forward Rendering with Video Diffusion Models"☆396Jun 13, 2025Updated last year
- Cosmos-Transfer1-7B-Sample-AV Toolkits☆47Jun 11, 2025Updated last year
- Stereo4D dataset and processing code☆309Nov 4, 2025Updated 7 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ICCV 2025 | TesserAct: Learning 4D Embodied World Models☆400Aug 4, 2025Updated 10 months ago
- Cosmos-Reason2 models understand the physical common sense and generate appropriate embodied decisions in natural language through long c…☆418Jun 7, 2026Updated 2 weeks ago
- [ICLR 2025] EdgeRunner: Auto-regressive Auto-encoder for Efficient Mesh Generation☆311Dec 22, 2024Updated last year
- Fillerbuster: Multi-View Scene Completion for Casual Captures☆113Feb 13, 2025Updated last year
- ViPE: Video Pose Engine for Geometric 3D Perception☆1,990Jun 9, 2026Updated last week
- Official Implementation of Driv3R☆108Dec 12, 2024Updated last year
- [IROS 2025 Best Paper Award Finalist & IEEE TRO 2026] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems☆3,060May 29, 2026Updated 3 weeks ago