open-gigaai / giga-world-0Links
GigaWorld-0: World Models as Data Engine to Empower Embodied AI
β717Updated 2 weeks ago
Alternatives and similar repositories for giga-world-0
Users that are interested in giga-world-0 are comparing it to the libraries listed below
Sorting:
- GigaBrain-0: A World Model-Powered Vision-Language-Action Modelβ831Updated 3 weeks ago
- π₯ The first open-sourced diffusion vision-langauge-action model.β138Updated this week
- Official implementation for "HA-VLN 2.0: An Open Benchmark and Leaderboard for Human-Aware Navigation in Discrete and Continuous Environmβ¦β371Updated last month
- 4DNeX: Feed-Forward 4D Generative Modeling Made Easyβ801Updated last week
- β545Updated last month
- β294Updated 2 months ago
- Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"β259Updated last month
- DeepThinkVLA: Enhancing Reasoning Capability of Vision-Language-Action Modelsβ450Updated last week
- GigaTrain: An Efficient and Scalable Training Framework for AI Modelsβ361Updated 2 weeks ago
- [NeurIPS2024] MVGamba: Unify 3D Content Generation as State Space Sequence Modelingβ65Updated last year
- [AAAI 2026 π₯] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"β174Updated 4 months ago
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow, CVPR2025β54Updated 3 months ago
- β248Updated 11 months ago
- Official Implementation of Puzzles: Unbounded Video-Depth Augmentation for Scalable, End-to-End 3D Reconstruction.β209Updated 3 months ago
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Modelβ1,804Updated last month
- Wan2.1 with Controlnetβ178Updated 8 months ago
- [ICRA 2025] PUGS: Zero-shot Physical Understanding with Gaussian Splatting.β102Updated 9 months ago
- [CVPR 2025] The code and model for our paper "Shadow Generation Using Diffusion Model with Geometry Prior", CVPR, 2025.β139Updated 3 weeks ago
- [ICCV2025 Highlight] Stereo Any Video: Temporally Consistent Stereo Matchingβ377Updated 2 weeks ago
- β30Updated last year
- [NeurIPS 2025] More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Modelsβ214Updated last month
- A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.β678Updated last week
- Efficient controlnet for DiTsβ382Updated 7 months ago
- Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Betterβ184Updated 6 months ago
- Efficient DiT architecture for text2any tasks, ICLR2025β449Updated 7 months ago
- [CVPR2024] Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusionβ135Updated last year
- [CVPR 2024 Highlight] DiVa360 datasetβ95Updated 5 months ago
- RealMirror, a comprehensive, open-source embodied AI VLA platform.β157Updated last week
- MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDEβ1,070Updated 2 months ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Viewsβ107Updated last week