alibaba-damo-academy / RynnECLinks
RynnEC: Bringing MLLMs into Embodied World
☆380Updated last month
Alternatives and similar repositories for RynnEC
Users that are interested in RynnEC are comparing it to the libraries listed below
Sorting:
- GigaWorld-0: World Models as Data Engine to Empower Embodied AI☆103Updated this week
- 4DNeX: Feed-Forward 4D Generative Modeling Made Easy☆789Updated last month
- [NeurIPS 2025 Spotlight] Towards Understanding Camera Motions in Any Video☆248Updated this week
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆170Updated this week
- [NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding☆510Updated last month
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆132Updated this week
- ☆140Updated 8 months ago
- [ICRA 2025] PUGS: Zero-shot Physical Understanding with Gaussian Splatting.☆101Updated 8 months ago
- [AAAI 2026 🔥] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"☆174Updated 3 months ago
- Uncommon Objects in 3D dataset☆1,307Updated 2 weeks ago
- A Unified Driving World Model for Future Generation and Perception☆126Updated 4 months ago
- This is the repository that contains source code for the PhysGen3D.☆231Updated 2 months ago
- [CVPR2024] Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion☆134Updated last year
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆252Updated last month
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model☆1,723Updated last week
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D☆164Updated 3 weeks ago
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆86Updated this week
- [CORL 2025 Oral]One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation.☆422Updated 3 months ago
- [ICCV2025 Highlight] Stereo Any Video: Temporally Consistent Stereo Matching☆372Updated 4 months ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views☆68Updated this week
- Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model☆893Updated last week
- [ICCV 2025] SAM2Long: Enhancing SAM 2 for Long Video Segmentation with a Training-Free Memory Tree☆533Updated 4 months ago
- Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"☆292Updated last month
- Unified Multimodal Model for image generation/editing/understanding☆811Updated 2 months ago
- Wan2.1 with Controlnet☆177Updated 8 months ago
- Official Implementation of Puzzles: Unbounded Video-Depth Augmentation for Scalable, End-to-End 3D Reconstruction.☆209Updated 2 months ago
- A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.☆635Updated last week
- Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation☆347Updated this week
- MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE☆1,060Updated last month
- [ICLR 2025] This is official implements of Swift4d: Adaptive divide-and-conquer Gaussian Splatting for compact and efficient reconstructi…☆138Updated 3 weeks ago