open-gigaai / giga-world-0Links
GigaWorld-0: World Models as Data Engine to Empower Embodied AI
☆1,328Updated last month
Alternatives and similar repositories for giga-world-0
Users that are interested in giga-world-0 are comparing it to the libraries listed below
Sorting:
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆2,064Updated 2 months ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.☆489Updated 3 weeks ago
- Official code of Motus: A Unified Latent Action World Model☆597Updated 3 weeks ago
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Models☆480Updated last week
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"☆268Updated 3 months ago
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆159Updated 3 weeks ago
- ☆545Updated 3 months ago
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environm…☆378Updated last month
- GigaTrain: An Efficient and Scalable Training Framework for AI Models☆915Updated last month
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views☆181Updated last month
- The accepted paper for cvpr2025.☆50Updated last month
- 🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World☆177Updated last week
- 4DNeX: Feed-Forward 4D Generative Modeling Made Easy☆818Updated last month
- ☆128Updated 2 months ago
- ☆316Updated 3 months ago
- GigaDatasets: A Unified and Lightweight Framework for Data Processing, Curation, and Visualization☆487Updated 3 weeks ago
- [NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modeling☆65Updated last year
- GigaModels: A Comprehensive Repository and Platform for Multi-modal, Generative, and Perceptual Models☆860Updated last month
- ☆246Updated last year
- This repository contains the code of the paper "IC-World: In-Context Generation for Shared World Modeling".☆123Updated 2 weeks ago
- 🌐 Forging Spatial Intelligence: A Roadmap of Multi-Modal Data Pre-Training for Autonomous Systems☆133Updated 3 weeks ago
- [AAAI 2026 Oral] Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution☆356Updated last month
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow, CVPR2025☆55Updated 5 months ago
- Match-Stereo-Videos via Bidirectional Alignment (An update of BiDAStereo)☆83Updated last month
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models☆215Updated 3 months ago
- ☆30Updated last year
- Official implementation of UniCalli: A Unified Diffusion Framework for Column-Level Generation and Recognition of Chinese Calligraphy☆186Updated this week
- Efficient controlnet for DiTs☆382Updated 8 months ago
- [NeurIPS'2025] Official repository for "LiveStar: Live Streaming Assistant for Real-World Online Video Understanding"☆104Updated 2 months ago
- Wan2.1 with Controlnet☆180Updated 10 months ago