NVIDIA / CosmosLinks
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
☆8,057Updated 6 months ago
Alternatives and similar repositories for Cosmos
Users that are interested in Cosmos are comparing it to the libraries listed below
Sorting:
- A suite of image and video neural tokenizers☆1,694Updated 10 months ago
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆2,586Updated 3 months ago
- [NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling☆4,142Updated 2 months ago
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,701Updated 3 weeks ago
- A generative world for general-purpose robotics & embodied AI learning.☆27,822Updated this week
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents☆1,878Updated 2 months ago
- NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.☆5,670Updated last week
- ☆3,140Updated 9 months ago
- High-resolution models for human tasks.☆5,255Updated last year
- Solve Visual Understanding with Reinforced VLMs☆5,771Updated 2 months ago
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,254Updated 5 months ago
- ☆9,477Updated last week
- Unified framework for robot learning built on NVIDIA Isaac Sim☆5,830Updated this week
- Witness the aha moment of VLM with less than $3.☆4,011Updated 7 months ago
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆4,776Updated 9 months ago
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆17,293Updated 3 weeks ago
- Janus-Series: Unified Multimodal Understanding and Generation Models☆17,643Updated 10 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,831Updated this week
- SAM 3D Objects☆4,963Updated last week
- [IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems☆2,659Updated last week
- The repository provides code for running inference and finetuning with the Meta Segment Anything Model 3 (SAM 3), links for downloading t…☆6,138Updated 2 weeks ago
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆1,948Updated last week
- s1: Simple test-time scaling☆6,620Updated 6 months ago
- MAGI-1: Autoregressive Video Generation at Scale☆3,613Updated 6 months ago
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆11,486Updated last month
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆18,095Updated last year
- Cosmos-Reason1 models understand the physical common sense and generate appropriate embodied decisions in natural language through long c…☆860Updated this week
- Wan: Open and Advanced Large-Scale Video Generative Models☆14,974Updated last week
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆20,312Updated last week
- NVIDIA Isaac Sim™ is an open-source application on NVIDIA Omniverse for developing, simulating, and testing AI-driven robots in realistic…☆2,145Updated this week