NVIDIA / CosmosLinks
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
☆8,022Updated 2 weeks ago
Alternatives and similar repositories for Cosmos
Users that are interested in Cosmos are comparing it to the libraries listed below
Sorting:
- SpatialLM: Training Large Language Models for Structured Indoor Modeling☆3,352Updated last week
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆11,094Updated last month
- Wan: Open and Advanced Large-Scale Video Generative Models☆12,263Updated last week
- ☆3,660Updated this week
- A generative world for general-purpose robotics & embodied AI learning.☆25,315Updated this week
- A suite of image and video neural tokenizers☆1,638Updated 4 months ago
- ☆3,363Updated 3 months ago
- NVIDIA Isaac GR00T N1.5 is the world's first open foundation model for generalized humanoid robot reasoning and skills.☆4,174Updated last week
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆3,077Updated 2 months ago
- The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems☆2,115Updated this week
- Official repository for LTX-Video☆6,745Updated 3 weeks ago
- Open-source unified multimodal model☆4,204Updated this week
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆15,086Updated this week
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents☆1,712Updated 3 weeks ago
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆6,846Updated 3 months ago
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆1,331Updated this week
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,279Updated 2 weeks ago
- Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).☆9,828Updated 3 weeks ago
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,344Updated this week
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆3,418Updated this week
- [CVPR2025 Highlight] Video Generation Foundation Models: https://saiyan-world.github.io/goku/☆2,853Updated 4 months ago
- Janus-Series: Unified Multimodal Understanding and Generation Models☆17,380Updated 4 months ago
- s1: Simple test-time scaling☆6,447Updated last month
- Easily fine-tune, evaluate and deploy Qwen3, DeepSeek-R1, Llama 4 or any open source LLM / VLM!☆8,187Updated this week
- Minimal reproduction of DeepSeek R1-Zero☆11,909Updated last month
- Official PyTorch implementation of One-Minute Video Generation with Test-Time Training☆1,620Updated 2 weeks ago
- ☆3,025Updated 3 months ago
- Unified framework for robot learning built on NVIDIA Isaac Sim☆3,929Updated this week
- Witness the aha moment of VLM with less than $3.☆3,768Updated last month
- The python library for real-time communication☆4,037Updated last week