nvidia-cosmos / cosmos-predict2.5
View external linksLinks

Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.

☆787

Alternatives and similar repositories for cosmos-predict2.5

Users that are interested in cosmos-predict2.5 are comparing it to the libraries listed below

Sorting:

nvidia-cosmos / cosmos-transfer2.5
View on GitHub
Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control in…
☆452Updated this week
nvidia-cosmos / cosmos-predict2
View on GitHub
Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…
☆736Oct 29, 2025Updated 3 months ago
XDimLab / Prometheus
View on GitHub
[CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation
☆143Jul 5, 2025Updated 7 months ago
Robbyant / lingbot-va
View on GitHub
Causal video-action world model for generalist robot control
☆647Feb 6, 2026Updated last week
nvidia-cosmos / cosmos-predict1
View on GitHub
Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…
☆404Jan 6, 2026Updated last month
NVlabs / GaussianSTORM
View on GitHub
[ICLR2025] A PyTorch implementation for STORM: Spatiotemporal Reconstruction Model for Large-Scale Outdoor Scenes
☆331May 22, 2025Updated 8 months ago
InternRobotics / Aether
View on GitHub
[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling
☆572Oct 26, 2025Updated 3 months ago
nv-tlabs / XCube
View on GitHub
[CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies
☆525Jun 30, 2025Updated 7 months ago
UMass-Embodied-AGI / TesserAct
View on GitHub
ICCV 2025 | TesserAct: Learning 4D Embodied World Models
☆379Aug 4, 2025Updated 6 months ago
dreamzero0 / dreamzero
View on GitHub
Code to load DreamZero model checkpoints and run evaluation on DROID-sim and Genie Sim 3.0
☆664Updated this week
nv-tlabs / SCube
View on GitHub
[NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats
☆519Oct 14, 2025Updated 4 months ago
nv-tlabs / GEN3C
View on GitHub
[CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control
☆1,259Sep 24, 2025Updated 4 months ago
yangzhou24 / OmniWorld
View on GitHub
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
☆427Jan 7, 2026Updated last month
DL3DV-10K / Dataset
View on GitHub
News: the 10k dataset is ready for download.
☆571Updated this week
haoyi-duan / WorldScore
View on GitHub
Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation
☆215Dec 9, 2025Updated 2 months ago
NVlabs / EdgeRunner
View on GitHub
[ICLR 2025] EdgeRunner: Auto-regressive Auto-encoder for Efficient Mesh Generation
☆299Dec 22, 2024Updated last year
yuhengliu02 / control-3d-scene
View on GitHub
Official implementation of paper "Controllable 3D Outdoor Scene Generation via Scene Graphs" (ICCV 2025)
☆62Jul 19, 2025Updated 6 months ago
basilevh / gcd
View on GitHub
Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation
☆282Nov 18, 2025Updated 2 months ago
VITA-Group / 4DGen
View on GitHub
"4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency", Yuyang Yin*, Dejia Xu*, Zhangyang Wang, Yao Zhao, Yunchao Wei
☆249Jun 24, 2024Updated last year
heheyas / V3D
View on GitHub
[T-PAMI 2025] V3D: Video Diffusion Models are Effective 3D Generators
☆514Mar 26, 2024Updated last year
arthurhero / Long-LRM
View on GitHub
Self-reimplemented version of Long-LRM.
☆215Dec 17, 2025Updated last month
facebookresearch / nwm
View on GitHub
Official code for the CVPR 2025 paper "Navigation World Models".
☆533Nov 24, 2025Updated 2 months ago
facebookresearch / mvdust3r
View on GitHub
Open source impl of **MV-DUSt3R+ Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds** from Meta Reality Labs. Project page …
☆579Updated this week
wd-ustc-cs / REArtGS
View on GitHub
Official implementation of REArtGS (NeurIPS 2025)
☆19Oct 24, 2025Updated 3 months ago
microsoft / VITRA
View on GitHub
[ICRA 2026] VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos
☆297Updated this week
henry123-boy / SpaTrackerV2
View on GitHub
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
☆909Sep 26, 2025Updated 4 months ago
nv-tlabs / InfiniCube
View on GitHub
[ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models
☆113Jan 23, 2026Updated 3 weeks ago
JiahuiLei / MoSca
View on GitHub
☆340Nov 29, 2024Updated last year
Tencent-Hunyuan / Hunyuan-4B
View on GitHub
☆17Aug 5, 2025Updated 6 months ago
CUT3R / CUT3R
View on GitHub
Official implementation of Continuous 3D Perception Model with Persistent State
☆1,329Aug 27, 2025Updated 5 months ago
NIRVANALAN / STream3R
View on GitHub
Dynamic 3D Foundation Model using Causal Transformer. [ICLR 2026]
☆307Feb 2, 2026Updated last week
InternRobotics / AnySplat
View on GitHub
[SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views
☆729Dec 22, 2025Updated last month
hwjiang1510 / RayZer
View on GitHub
Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"
☆408Nov 24, 2025Updated 2 months ago
KlingAIResearch / RoboMaster
View on GitHub
[ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control
☆94Updated this week
kylesargent / ZeroNVS
View on GitHub
☆524Nov 29, 2023Updated 2 years ago
microsoft / MoGe
View on GitHub
[CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision
☆2,287Nov 2, 2025Updated 3 months ago
NVlabs / DiffusionNFT
View on GitHub
[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process
☆626Feb 6, 2026Updated last week
dcharatan / pixelsplat
View on GitHub
[CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruc…
☆1,208Jan 13, 2025Updated last year
FrozenBurning / SceneDreamer
View on GitHub
[TPAMI 2023] SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections
☆657Aug 14, 2024Updated last year

nvidia-cosmos / cosmos-predict2.5View external linksLinks

Alternatives and similar repositories for cosmos-predict2.5

nvidia-cosmos / cosmos-predict2.5
View external linksLinks