nvidia-cosmos / cosmos-predict2.5View external linksLinks
Cosmos-Predict2.5, the latest version of the Cosmos World Foundation Models (WFMs) family, specialized for simulating and predicting the future state of the world in the form of video.
☆787Jan 28, 2026Updated 2 weeks ago
Alternatives and similar repositories for cosmos-predict2.5
Users that are interested in cosmos-predict2.5 are comparing it to the libraries listed below
Sorting:
- Cosmos-Transfer2.5, built on top of Cosmos-Predict2.5, produces high-quality world simulations conditioned on multiple spatial control in…☆452Updated this week
- Cosmos-Predict2 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆736Oct 29, 2025Updated 3 months ago
- [CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆143Jul 5, 2025Updated 7 months ago
- Causal video-action world model for generalist robot control☆647Feb 6, 2026Updated last week
- Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆404Jan 6, 2026Updated last month
- [ICLR2025] A PyTorch implementation for STORM: Spatiotemporal Reconstruction Model for Large-Scale Outdoor Scenes☆331May 22, 2025Updated 8 months ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆572Oct 26, 2025Updated 3 months ago
- [CVPR 2024 Highlight] XCube: Large-Scale 3D Generative Modeling using Sparse Voxel Hierarchies☆525Jun 30, 2025Updated 7 months ago
- ICCV 2025 | TesserAct: Learning 4D Embodied World Models☆379Aug 4, 2025Updated 6 months ago
- Code to load DreamZero model checkpoints and run evaluation on DROID-sim and Genie Sim 3.0☆664Updated this week
- [NeurIPS 2024] SCube: Instant Large-Scale Scene Reconstruction using VoxSplats☆519Oct 14, 2025Updated 4 months ago
- [CVPR 2025 Highlight] GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control☆1,259Sep 24, 2025Updated 4 months ago
- [ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆427Jan 7, 2026Updated last month
- News: the 10k dataset is ready for download.☆571Updated this week
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆215Dec 9, 2025Updated 2 months ago
- [ICLR 2025] EdgeRunner: Auto-regressive Auto-encoder for Efficient Mesh Generation☆299Dec 22, 2024Updated last year
- Official implementation of paper "Controllable 3D Outdoor Scene Generation via Scene Graphs" (ICCV 2025)☆62Jul 19, 2025Updated 6 months ago
- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis (ECCV 2024 Oral) - Official Implementation☆282Nov 18, 2025Updated 2 months ago
- "4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency", Yuyang Yin*, Dejia Xu*, Zhangyang Wang, Yao Zhao, Yunchao Wei☆249Jun 24, 2024Updated last year
- [T-PAMI 2025] V3D: Video Diffusion Models are Effective 3D Generators☆514Mar 26, 2024Updated last year
- Self-reimplemented version of Long-LRM.☆215Dec 17, 2025Updated last month
- Official code for the CVPR 2025 paper "Navigation World Models".☆533Nov 24, 2025Updated 2 months ago
- Open source impl of **MV-DUSt3R+ Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds** from Meta Reality Labs. Project page …☆579Updated this week
- Official implementation of REArtGS (NeurIPS 2025)☆19Oct 24, 2025Updated 3 months ago
- [ICRA 2026] VITRA: Scalable Vision-Language-Action Model Pretraining for Robotic Manipulation with Real-Life Human Activity Videos☆297Updated this week
- [ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy☆909Sep 26, 2025Updated 4 months ago
- [ICCV 2025] InfiniCube: Unbounded and Controllable Dynamic 3D Driving Scene Generation with World-Guided Video Models☆113Jan 23, 2026Updated 3 weeks ago
- ☆340Nov 29, 2024Updated last year
- ☆17Aug 5, 2025Updated 6 months ago
- Official implementation of Continuous 3D Perception Model with Persistent State☆1,329Aug 27, 2025Updated 5 months ago
- Dynamic 3D Foundation Model using Causal Transformer. [ICLR 2026]☆307Feb 2, 2026Updated last week
- [SIGGRAPH Asia 2025 (ACM TOG)] AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views☆729Dec 22, 2025Updated last month
- Code for ICCV'2025 (Best student paper honorable mention) "RayZer: A Self-supervised Large View Synthesis Model"☆408Nov 24, 2025Updated 2 months ago
- [ICLR’26] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆94Updated this week
- ☆524Nov 29, 2023Updated 2 years ago
- [CVPR'25 Oral] MoGe: Unlocking Accurate Monocular Geometry Estimation for Open-Domain Images with Optimal Training Supervision☆2,287Nov 2, 2025Updated 3 months ago
- [ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆626Feb 6, 2026Updated last week
- [CVPR 2024 Oral, Best Paper Runner-Up] Code for "pixelSplat: 3D Gaussian Splats from Image Pairs for Scalable Generalizable 3D Reconstruc…☆1,208Jan 13, 2025Updated last year
- [TPAMI 2023] SceneDreamer: Unbounded 3D Scene Generation from 2D Image Collections☆657Aug 14, 2024Updated last year