NVIDIA / CosmosLinks
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
☆8,057Updated 2 months ago
Alternatives and similar repositories for Cosmos
Users that are interested in Cosmos are comparing it to the libraries listed below
Sorting:
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆3,574Updated 4 months ago
- A suite of image and video neural tokenizers☆1,666Updated 6 months ago
- ☆4,305Updated last week
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,482Updated 2 weeks ago
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆2,039Updated this week
- High-resolution models for human tasks.☆5,118Updated 9 months ago
- A generative world for general-purpose robotics & embodied AI learning.☆27,051Updated this week
- SpatialLM: Training Large Language Models for Structured Indoor Modeling☆3,750Updated 3 weeks ago
- Reference PyTorch implementation and models for DINOv3☆2,877Updated this week
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents☆1,778Updated 2 months ago
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆3,123Updated last month
- MAGI-1: Autoregressive Video Generation at Scale☆3,452Updated 2 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,442Updated last month
- Sky-T1: Train your own O1 preview model within $450☆3,319Updated last month
- ☆3,459Updated 5 months ago
- NVIDIA Isaac GR00T N1.5 is the world's first open foundation model for generalized humanoid robot reasoning and skills.☆4,682Updated this week
- Open-source unified multimodal model☆4,829Updated last month
- Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.☆4,745Updated 3 months ago
- s1: Simple test-time scaling☆6,533Updated last month
- Easily fine-tune, evaluate and deploy gpt-oss, Qwen3, DeepSeek-R1, or any open source LLM / VLM!☆8,400Updated this week
- Minimal reproduction of DeepSeek R1-Zero☆12,121Updated 3 months ago
- Witness the aha moment of VLM with less than $3.☆3,902Updated 3 months ago
- [IROS 2025] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems☆2,290Updated 2 weeks ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆3,907Updated this week
- This package contains the original 2012 AlexNet code.☆2,692Updated 5 months ago
- Wan: Open and Advanced Large-Scale Video Generative Models☆3,475Updated 2 weeks ago
- Unified framework for robot learning built on NVIDIA Isaac Sim☆4,617Updated this week
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆16,580Updated last week
- DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 S…☆1,852Updated 8 months ago
- Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your resea…☆4,664Updated 4 months ago