Genesis-Embodied-AI / GenesisLinks
A generative world for general-purpose robotics & embodied AI learning.
☆27,568Updated this week
Alternatives and similar repositories for Genesis
Users that are interested in Genesis are comparing it to the libraries listed below
Sorting:
- New repo collection for NVIDIA Cosmos: https://github.com/nvidia-cosmos☆8,067Updated 5 months ago
- [NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling☆4,080Updated last month
- Unified framework for robot learning built on NVIDIA Isaac Sim☆5,422Updated this week
- [IROS 2025 Award Finalist] The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems☆2,596Updated 2 weeks ago
- NVIDIA Isaac GR00T N1.5 - A Foundation Model for Generalist Robots.☆5,322Updated last week
- ☆8,700Updated 3 weeks ago
- OpenVLA: An open-source vision-language-action model for robotic manipulation.☆4,344Updated 7 months ago
- 🐫 CAMEL: The first and the best multi-agent framework. Finding the Scaling Law of Agents. https://www.camel-ai.org☆14,740Updated this week
- Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.☆16,124Updated 2 weeks ago
- A Python framework for accelerated simulation, data generation and spatial computing.☆5,754Updated last week
- The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoi…☆52,429Updated last year
- CoTracker is a model for tracking any point (pixel) on a video.☆4,663Updated 9 months ago
- [CVPR 2025] Magma: A Foundation Model for Multimodal AI Agents☆1,844Updated last month
- Official repository of "SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware Memory"☆6,994Updated 7 months ago
- Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).☆10,939Updated last week
- A simple screen parsing tool towards pure vision based GUI agent☆23,813Updated 2 months ago
- Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.☆25,335Updated last month
- LLM training in simple, raw C/CUDA☆28,139Updated 4 months ago
- ☆100,209Updated 2 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.☆48,036Updated this week
- ☆2,516Updated 3 months ago
- Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…☆9,073Updated last week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆23,915Updated last year
- [CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation☆7,830Updated last year
- Wan: Open and Advanced Large-Scale Video Generative Models☆14,667Updated 3 months ago
- Infinite Photorealistic Worlds using Procedural Generation☆6,696Updated 3 weeks ago
- This package contains the original 2012 AlexNet code.☆2,766Updated 8 months ago
- PyTorch code and models for VJEPA2 self-supervised learning from video.☆2,399Updated 2 months ago
- 🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning☆19,188Updated this week
- ☆14,100Updated 3 weeks ago