NVIDIA / CosmosLinks
New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos
☆8,057Updated 3 months ago
Alternatives and similar repositories for Cosmos
Users that are interested in Cosmos are comparing it to the libraries listed below
Sorting:
- A suite of image and video neural tokenizers☆1,668Updated 7 months ago
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,539Updated last month
- High-resolution models for human tasks.☆5,143Updated 9 months ago
- ☆4,971Updated last week
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆2,177Updated 2 weeks ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,491Updated this week
- A generative world for general-purpose robotics & embodied AI learning.☆27,236Updated this week
- SpatialLM: Training Large Language Models for Structured Indoor Modeling☆3,942Updated 2 weeks ago
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents☆1,802Updated 3 months ago
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆3,819Updated 5 months ago
- NVIDIA Isaac GR00T N1.5 is the world's first open foundation model for generalized humanoid robot reasoning and skills.☆4,881Updated last week
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆6,947Updated 5 months ago
- Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆12,483Updated 4 months ago
- [IROS 2025] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems☆2,353Updated this week
- PyTorch code and models for V-JEPA self-supervised learning from video.☆3,200Updated 6 months ago
- MAGI-1: Autoregressive Video Generation at Scale☆3,481Updated 3 months ago
- DIAMOND (DIffusion As a Model Of eNvironment Dreams) is a reinforcement learning agent trained in a diffusion world model. NeurIPS 2024 S…☆1,864Updated 9 months ago
- Unified framework for robot learning built on NVIDIA Isaac Sim☆4,832Updated this week
- CoTracker is a model for tracking any point (pixel) on a video.☆4,553Updated 7 months ago
- ☆3,462Updated 6 months ago
- The best OSS video generation models, created by Genmo☆3,394Updated last week
- ☆3,511Updated last year
- Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.☆4,710Updated last month
- State-of-the-art Image & Video CLIP, Multimodal Large Language Models, and More!☆1,607Updated last week
- Open-source unified multimodal model☆4,993Updated 3 weeks ago
- Reference PyTorch implementation and models for DINOv3☆6,749Updated last week
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆17,289Updated this week
- [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior☆2,968Updated 4 months ago
- ☆3,108Updated 6 months ago
- The simplest, fastest repository for training/finetuning small-sized VLMs.☆4,026Updated last week