HorizonRobotics / EmbodiedGenLinks
Towards a Generative 3D World Engine for Embodied Intelligence
☆274Updated last week
Alternatives and similar repositories for EmbodiedGen
Users that are interested in EmbodiedGen are comparing it to the libraries listed below
Sorting:
- ☆166Updated 2 weeks ago
- [ICCV 2025] Aether: Geometric-Aware Unified World Modeling☆429Updated last month
- PhysX: Physical-Grounded 3D Asset Generation☆221Updated this week
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆54Updated last month
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆78Updated last month
- InteriorGS: 3D Gaussian Splatting Dataset of Semantically Labeled Indoor Scenes☆80Updated 2 weeks ago
- [ICLR 2025] Official implementation of Articulate-Anything☆132Updated last month
- [ICCV 2025] PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos☆260Updated this week
- Code for Streaming 4D Visual Geometry Transformer☆489Updated 2 weeks ago
- [ICCV 2025] LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion☆268Updated 3 weeks ago
- Orient Anything, ICML 2025☆303Updated 2 months ago
- A diffusion model-based stereo depth estimation framework that can predict and restore noisy depth maps for transparent and specular surf…☆76Updated 5 months ago
- Code implementation of CVPR 2024 highlight paper "PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI"☆160Updated 2 months ago
- 3D-R1: Enhancing Reasoning in 3D VLMs for Unified Scene Understanding☆52Updated this week
- TAPIP3D: Tracking Any Point in Persistent 3D Geometry☆274Updated 2 months ago
- Generative World Explorer☆150Updated last month
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆204Updated 3 months ago
- Multi-SpatialMLLM Multi-Frame Spatial Understanding with Multi-Modal Large Language Models☆140Updated 2 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆163Updated last month
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆120Updated 3 months ago
- VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction☆235Updated last week
- ☆109Updated 3 months ago
- A curated list of awesome papers for reconstructing 4D spatial intelligence from video. (arXiv 2507.21045)☆228Updated this week
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆75Updated last month
- ☆114Updated 2 months ago
- ☆20Updated 2 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆56Updated 4 months ago
- Code for "Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling" (CoRL 2024)☆109Updated 7 months ago
- ☆107Updated 5 months ago
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆27Updated 4 months ago