alibaba-damo-academy / RynnECLinks
RynnEC: Bringing MLLMs into Embodied World
☆383Updated 3 months ago
Alternatives and similar repositories for RynnEC
Users that are interested in RynnEC are comparing it to the libraries listed below
Sorting:
- Official code of Motus: A Unified Latent Action World Model☆597Updated 3 weeks ago
- 4DNeX: Feed-Forward 4D Generative Modeling Made Easy☆818Updated last month
- [NeurIPS 2025 Spotlight] Towards Understanding Camera Motions in Any Video☆265Updated 2 months ago
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels☆179Updated last month
- [NeurIPS 2025 Spotlight] A Native Multimodal LLM for 3D Generation and Understanding☆538Updated 3 months ago
- ☆140Updated 10 months ago
- 🔥 The first open-sourced diffusion vision-langauge-action model.☆159Updated 3 weeks ago
- G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning☆257Updated 2 weeks ago
- GigaWorld-0: World Models as Data Engine to Empower Embodied AI☆1,328Updated last month
- [ICRA 2025] PUGS: Zero-shot Physical Understanding with Gaussian Splatting.☆104Updated 10 months ago
- NEO Series: Native Vision-Language Models from First Principles☆637Updated 3 weeks ago
- 🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World☆177Updated last week
- Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views☆181Updated last month
- This is the repository that contains source code for the PhysGen3D.☆240Updated 4 months ago
- [AAAI 2026 🔥] Official implementation of "NeuralGS: Bridging Neural Fields and 3D Gaussian Splatting for Compact 3D Representation"☆176Updated 5 months ago
- OmniNWM: Omniscient Navigation World Models for Autonomous Driving☆269Updated 3 months ago
- [CVPR2024] Instruct 4D-to-4D: Editing 4D Scenes as Pseudo-3D Scenes Using 2D Diffusion☆136Updated last year
- [CORL 2025 Oral]One View, Many Worlds: Single-Image to 3D Object Meets Generative Domain Randomization for One-Shot 6D Pose Estimation.☆446Updated 5 months ago
- [ICLR 2026] Thinking with Camera: A Unified Multimodal Model for Camera-Centric Understanding and Generation☆378Updated this week
- A Unified Driving World Model for Future Generation and Perception☆134Updated 6 months ago
- [NeurIPS 2025 DB Track] 3EED: Ground Everything Everywhere in 3D☆200Updated last month
- Uncommon Objects in 3D dataset☆1,310Updated 2 months ago
- 🔥 OneThinker: All-in-one Reasoning Model for Image and Video☆380Updated 3 weeks ago
- A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.☆706Updated last week
- 🌐 3D and 4D World Modeling: A Survey☆783Updated 2 weeks ago
- Official repo for "GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization"☆248Updated last week
- GigaBrain-0: A World Model-Powered Vision-Language-Action Model☆2,064Updated 2 months ago
- Official implementation of "ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation"☆85Updated last month
- Official implementation of "JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization"☆313Updated 3 weeks ago
- Wan2.1 with Controlnet☆180Updated 10 months ago