alibaba-damo-academy / RynnECLinks
RynnEC: Bringing MLLMs into Embodied World
☆382Updated last month
Alternatives and similar repositories for RynnEC
Users that are interested in RynnEC are comparing it to the libraries listed below
Sorting:
- 4DNeX: Feed-Forward 4D Generative Modeling Made Easy☆801Updated last week
- [NeurIPS 2025 Spotlight] Towards Understanding Camera Motions in Any Video☆250Updated 3 weeks ago
- GigaWorld-0: World Models as Data Engine to Empower Embodied AI☆717Updated 2 weeks ago
- [NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding☆515Updated 2 months ago
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆226Updated 3 weeks ago
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆831Updated 3 weeks ago
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views☆107Updated last week
- [CVPR2024] Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion☆135Updated last year
- Official repo for "GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization"☆226Updated this week
- 🔥 OneThinker: All-in-one Reasoning Model for Image and Video☆319Updated last week
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels☆106Updated this week
- 🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World☆133Updated this week
- ☆140Updated 8 months ago
- NEO Series: Native Vision-Language Models from First Principles☆502Updated 2 months ago
- [AAAI 2026 🔥] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"☆174Updated 4 months ago
- Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"☆302Updated 2 weeks ago
- [ICRA 2025] PUGS: Zero-shot Physical Understanding with Gaussian Splatting.☆102Updated 9 months ago
- A Unified Driving World Model for Future Generation and Perception☆127Updated 4 months ago
- This is the repository that contains source code for the PhysGen3D.☆233Updated 3 months ago
- Lumina-DiMOO - An Open-Sourced Multi-Modal Large Diffusion Language Model☆910Updated 3 weeks ago
- A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.☆678Updated last week
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆263Updated last month
- Uncommon Objects in 3D dataset☆1,308Updated last month
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D☆193Updated last week
- Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation☆370Updated 3 weeks ago
- MixGRPO: Unlocking Flow-based GRPO Efficiency with Mixed ODE-SDE☆1,070Updated 2 months ago
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆138Updated this week
- Unified Multimodal Model for image generation/editing/understanding☆818Updated 3 months ago
- [Tech Report] Few-Step Distillation for Text-to-Image Generation: A Practical Guide☆132Updated this week
- VLA-Adapter: An Effective Paradigm for Tiny-Scale Vision-Language-Action Model☆1,804Updated last month