nvidia-cosmos / cosmos-predict1Links

Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world models for downstream applications.

☆378

Alternatives and similar repositories for cosmos-predict1

Users that are interested in cosmos-predict1 are comparing it to the libraries listed below

Sorting:

nvidia-cosmos / cosmos-transfer1
Cosmos-Transfer1 is a world-to-world transfer model designed to bridge the perceptual divide between simulated and real-world environment…
☆729Updated 3 weeks ago
nvidia-cosmos / cosmos-predict2
Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…
☆670Updated 3 weeks ago
nvidia-cosmos / cosmos-transfer2.5
Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control in…
☆205Updated last week
nvidia-cosmos / cosmos-predict2.5
Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the …
☆410Updated last week
nvidia-cosmos / cosmos-reason1
Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long c…
☆799Updated last week
facebookresearch / locate-3d
Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset
☆387Updated 5 months ago
facebookresearch / nwm
Official code for the CVPR 2025 paper "Navigation World Models".
☆440Updated 3 months ago
kwsong0113 / diffusion-forcing-transformer
[ICML 2025] Official PyTorch Implementation of "History-Guided Video Diffusion"
☆554Updated 4 months ago
ZCMax / LLaVA-3D
[ICCV 2025] A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World
☆348Updated last month
GenEx-world / genex
Generative World Explorer
☆159Updated 5 months ago
yangzhou24 / OmniWorld
OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
☆389Updated last week
Little-Podi / AdaWorld
[ICML'25] The PyTorch implementation of paper: "AdaWorld: Learning Adaptable World Models with Latent Actions".
☆175Updated 5 months ago
InternRobotics / Aether
[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling
☆534Updated 3 weeks ago
NVIDIA / GR00T-Dreams
Nvidia GEAR Lab's initiative to solve the robotics data problem using world models
☆380Updated 3 weeks ago
knightnemo / Awesome-World-Models
A Curated List of Awesome Works in World Modeling, Aiming to Serve as a One-stop Resource for Researchers, Practitioners, and Enthusiasts…
☆782Updated last week
google-deepmind / physics-IQ-benchmark
Benchmarking physical understanding in generative video models
☆219Updated 3 weeks ago
InternRobotics / Infinite-Mobility
☆177Updated 3 months ago
phyworld / phyworld
☆150Updated 10 months ago
HorizonRobotics / EmbodiedGen
Towards a Generative 3D World Engine for Embodied Intelligence
☆341Updated last week
diankun-wu / Spatial-MLLM
Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence
☆386Updated 5 months ago
SpatialVision / Orient-Anything
Orient Anything, ICML 2025
☆348Updated last month
behavior-vision-suite / behavior-vision-suite.github.io
☆170Updated 9 months ago
xizaoqu / WorldMem
[NeurIPS 2025] WorldMem: Long-term Consistent World Simulation with Memory
☆268Updated 3 weeks ago
VITA-Group / VLM-3R
VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
☆295Updated 2 months ago
myscience / open-genie
Pytorch implementation of "Genie: Generative Interactive Environments", Bruce et al. (2024).
☆226Updated last year
MaureenZOU / m3-spatial
[ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory
☆190Updated 6 months ago
ShuangLI59 / unified_video_action
Official PyTorch Implementation of Unified Video Action Model (RSS 2025)
☆291Updated 3 months ago
LatentActionPretraining / LAPA
[ICLR 2025] LAPA: Latent Action Pretraining from Videos
☆403Updated 10 months ago
ByteDance-Seed / TraceAnything
Trace Anything: Representing Any Video in 4D via Trajectory Fields
☆406Updated 3 weeks ago
nvidia-cosmos / cosmos-curate
Cosmos-Curate is a powerful video curation system that processes, analyzes, and organizes video content using advanced AI models and dist…
☆107Updated this week