Genesis-Embodied-AI / GenesisLinks
A generative world for general-purpose robotics & embodied AI learning.
☆27,822Updated last week
Alternatives and similar repositories for Genesis
Users that are interested in Genesis are comparing it to the libraries listed below
Sorting:
- New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos☆8,057Updated 6 months ago
- Open-Sora: Democratizing Efficient Video Production for All☆28,151Updated 7 months ago
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆4,829Updated 9 months ago
- ☆9,477Updated last week
- Janus-Series: Unified Multimodal Understanding and Generation Models☆17,647Updated 10 months ago
- [NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling☆4,142Updated 3 months ago
- Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).☆11,372Updated last month
- Unified framework for robot learning built on NVIDIA Isaac Sim☆5,830Updated last week
- DeepSeek Coder: Let the Code Write Itself☆22,522Updated last month
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆20,312Updated last week
- The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained mode…☆18,095Updated last year
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,188Updated last year
- [IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems☆2,659Updated last week
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆11,486Updated last month
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆7,029Updated 9 months ago
- Qwen3-Coder is the code version of Qwen3, the large language model series developed by Qwen team, Alibaba Cloud.☆14,736Updated 3 weeks ago
- A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites☆4,162Updated 3 weeks ago
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆17,293Updated 3 weeks ago
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆25,863Updated 2 months ago
- High-resolution models for human tasks.☆5,255Updated last year
- Official repository for LTX-Video☆8,917Updated 2 months ago
- 🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.☆32,152Updated last week
- gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI☆19,441Updated last month
- NVIDIA Isaac GR00T N1.6 - A Foundation Model for Generalist Robots.☆5,670Updated last week
- aider is AI pair programming in your terminal☆39,167Updated last week
- VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and clou…☆3,701Updated 3 weeks ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,831Updated last week
- Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.☆5,115Updated 8 months ago
- [CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型☆9,643Updated 3 months ago
- Official inference repo for FLUX.1 models☆24,932Updated 4 months ago