NVIDIA/cosmos

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVIDIA/cosmos)

NVIDIA / cosmos

NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomous vehicles, smart infrastructure, and more.

☆11,309

Alternatives and similar repositories for cosmos

Users that are interested in cosmos are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Genesis-Embodied-AI / genesis-world
View on GitHub
Simulation platform for general-purpose robotics & embodied AI learning.
☆29,667Updated this week
NVIDIA / Cosmos-Tokenizer
View on GitHub
A suite of image and video neural tokenizers
☆1,732Feb 11, 2025Updated last year
Physical-Intelligence / openpi
View on GitHub
☆13,064Jun 16, 2026Updated last month
dreamzero0 / dreamzero
View on GitHub
Code to pretrain, fine-tune, and evaluate DreamZero and run sim & real-world evals
☆2,509Apr 19, 2026Updated 3 months ago
NVIDIA / Isaac-GR00T
View on GitHub
NVIDIA Isaac GR00T N1.7 - A Foundation Model for Generalist Robots.
☆7,703Updated this week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
NVIDIA / cosmos-framework
View on GitHub
Our inference and training framework to run on the Cosmos Models
☆424Updated this week
nvidia-cosmos / cosmos-transfer1
View on GitHub
Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environment…
☆813Jun 7, 2026Updated last month
NVlabs / Sana
View on GitHub
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
☆8,621Updated this week
facebookresearch / vggt
View on GitHub
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
☆14,053May 19, 2026Updated 2 months ago
facebookresearch / vjepa2
View on GitHub
PyTorch code and models for VJEPA2 self-supervised learning from video.
☆4,407Mar 23, 2026Updated 4 months ago
Robbyant / lingbot-va
View on GitHub
[RSS 2026] Causal video-action world model for generalist robot control
☆1,700Jul 9, 2026Updated 3 weeks ago
nvidia-cosmos / cosmos-predict2.5
View on GitHub
Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …
☆1,338Jun 8, 2026Updated last month
ByteDance-Seed / Bagel
View on GitHub
Open-source unified multimodal model
☆6,131May 4, 2026Updated 2 months ago
openvla / openvla
View on GitHub
OpenVLA: An open-source vision-language-action model for robotic manipulation.
☆6,736Mar 23, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hao-ai-lab / FastVideo
View on GitHub
A unified inference and post-training framework for accelerated video generation.
☆3,896Updated this week
huggingface / lerobot
View on GitHub
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
☆26,255Updated this week
Robbyant / lingbot-world
View on GitHub
Advancing Open-source World Models
☆4,310Jul 9, 2026Updated 3 weeks ago
OpenDriveLab / AgiBot-World
View on GitHub
[IROS 2025 Best Paper Award Finalist & IEEE TRO 2026] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
☆3,112May 29, 2026Updated 2 months ago
facebookresearch / dinov3
View on GitHub
Reference PyTorch implementation and models for DINOv3
☆11,063Jul 15, 2026Updated 2 weeks ago
buoyancy99 / diffusion-forcing
View on GitHub
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
☆1,279Jul 6, 2026Updated 3 weeks ago
Wan-Video / Wan2.1
View on GitHub
Wan: Open and Advanced Large-Scale Video Generative Models
☆16,696Mar 5, 2026Updated 4 months ago
facebookresearch / sam2
View on GitHub
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…
☆19,630May 30, 2026Updated 2 months ago
nv-tlabs / GEN3C
View on GitHub
[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
☆1,390Jun 15, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nv-tlabs / vipe
View on GitHub
ViPE: Video Pose Engine for Geometric 3D Perception
☆2,056Jun 9, 2026Updated last month
yyfz / Pi3
View on GitHub
[ICLR 2026] π^3: Permutation-Equivariant Visual Geometry Learning
☆2,096Jul 3, 2026Updated 3 weeks ago
nvidia-cosmos / cosmos-predict2
View on GitHub
Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…
☆794Oct 29, 2025Updated 9 months ago
CUT3R / CUT3R
View on GitHub
Official implementation of Continuous 3D Perception Model with Persistent State
☆1,471Aug 27, 2025Updated 11 months ago
QwenLM / Qwen3-VL
View on GitHub
Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
☆19,694Jan 30, 2026Updated 6 months ago
nvidia-cosmos / cosmos-reason1
View on GitHub
Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long c…
☆952Jun 7, 2026Updated last month
NVlabs / cosmos-policy
View on GitHub
Cosmos Policy
☆842Jan 23, 2026Updated 6 months ago
guandeh17 / Self-Forcing
View on GitHub
Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)
☆3,467Sep 12, 2025Updated 10 months ago
microsoft / MoGe
View on GitHub
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
☆2,719Jul 21, 2026Updated last week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
baaivision / Emu3.5
View on GitHub
Native Multimodal Models are World Learners
☆1,542Dec 30, 2025Updated 7 months ago
ByteDance-Seed / Depth-Anything-3
View on GitHub
Depth Anything 3
☆6,008Updated this week
yuantianyuan01 / FastWAM
View on GitHub
Official codebase for Fast-WAM: Do World Action Models Need Test-time Future Imagination?
☆1,225Apr 3, 2026Updated 3 months ago
RoboVerseOrg / RoboVerse
View on GitHub
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
☆1,792Updated this week
Junyi42 / monst3r
View on GitHub
Official Implementation of paper "MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion"
☆1,383Jun 16, 2025Updated last year
isaac-sim / IsaacLab
View on GitHub
Unified framework for robot learning built on NVIDIA Isaac Sim
☆7,799Updated this week
bytetriper / RAE
View on GitHub
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
☆1,982Feb 25, 2026Updated 5 months ago