UnrealZoo / unrealzoo-gym
Large-scale photo-realistic virtual worlds for embodied AI
☆127Updated last month
Alternatives and similar repositories for unrealzoo-gym:
Users that are interested in unrealzoo-gym are comparing it to the libraries listed below
- Code implementation of CVPR 2024 highlight paper "PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI"☆141Updated 5 months ago
- Generative World Explorer☆141Updated 4 months ago
- Aether: Geometric-Aware Unified World Modeling☆267Updated 3 weeks ago
- [ICLR 2025] Official Implementation of M3: 3D-Spatial Multimodal Memory☆143Updated this week
- [ECCV 2024] ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation☆220Updated 3 weeks ago
- A Simple yet Effective Pathway to Empowering LLaVA to Understand and Interact with 3D World☆249Updated 4 months ago
- ☆53Updated 2 weeks ago
- [CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"☆103Updated this week
- Unifying 2D and 3D Vision-Language Understanding☆69Updated last week
- [CoRL 2024] VLM-Grounder: A VLM Agent for Zero-Shot 3D Visual Grounding☆97Updated 4 months ago
- MetaSpatial leverages reinforcement learning to enhance 3D spatial reasoning in vision-language models (VLMs), enabling more structured, …☆104Updated 3 weeks ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆146Updated last week
- [ICLR 2025 Spotlight] MetaUrban: An Embodied AI Simulation Platform for Urban Micromobility☆160Updated last month
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆88Updated last week
- Cosmos-Predict1 is a collection of general-purpose world foundation models for Physical AI that can be fine-tuned into customized world m…☆155Updated this week
- 🔥 SpatialVLA: a spatial-enhanced vision-language-action model that is trained on 1.1 Million real robot episodes.☆226Updated last month
- [NeurIPS'24] Large Spatial Model: End-to-end Unposed Images to Semantic 3D☆172Updated last week
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆55Updated 3 weeks ago
- ☆58Updated 3 months ago
- Official code for the CVPR 2025 paper "Navigation World Models".☆50Updated last week
- [ICLR 2024 Spotlight] Unified Human-Scene Interaction via Prompted Chain-of-Contacts☆200Updated last week
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆90Updated this week
- SceneFun3D ToolKit☆131Updated this week
- This repository contains the implementation of the paper: "ChatCam: Empowering Camera Control through Conversational AI", NeurIPS 2024.☆15Updated 5 months ago
- Official implementation of ECCV24 paper "SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding"☆240Updated last month
- Code for "Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling" (CoRL 2024)☆100Updated 3 months ago
- ☆158Updated 2 months ago
- [CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding☆101Updated 2 weeks ago
- [NeurIPS'24] This repository is the implementation of "SpatialRGPT: Grounded Spatial Reasoning in Vision Language Models"☆170Updated 4 months ago
- PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from Videos☆191Updated this week