lzylucy / 4dgenLinks
Codebase for paper "Geometry-aware 4D Video Generation for Robot Manipulation"
☆70Updated this week
Alternatives and similar repositories for 4dgen
Users that are interested in 4dgen are comparing it to the libraries listed below
Sorting:
- PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation☆157Updated this week
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆216Updated 2 months ago
- ☆45Updated 2 months ago
- Official implementation of Spatial-Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model☆152Updated last month
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆60Updated 9 months ago
- View-Invariant Policy Learning via Zero-Shot Novel View Synthesis (CoRL 2024)☆27Updated 3 months ago
- ☆139Updated 8 months ago
- Evo-0: Vision-Language-Action Model with Implicit Spatial Understanding.☆51Updated last month
- [CVPR 25] G3Flow: Generative 3D Semantic Flow for Pose-aware and Generalizable Object Manipulation☆93Updated 7 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆171Updated 6 months ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆100Updated 3 months ago
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆66Updated 6 months ago
- ☆67Updated 5 months ago
- Implementation of Prompting with the Future: Open-World Model Predictive Control with Interactive Digital Twins. [RSS 2025]☆46Updated 2 months ago
- [RSS 2025] Novel Demonstration Generation with Gaussian Splatting Enables Robust One-Shot Manipulation☆161Updated 7 months ago
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆40Updated 3 months ago
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆90Updated last year
- MAPLE infuses dexterous manipulation priors from egocentric videos into vision encoders, making their features well-suited for downstream…☆28Updated last month
- [CVPR 2024] GenNBV: Generalizable Next-Best-View Policy for Active 3D Reconstruction☆77Updated 5 months ago
- Sim-to-real and CDM inference code for ManipAsInSim project.☆135Updated last month
- [ICCV'25] Towards Scalable Gaussian World Models for Robotic Manipulation☆70Updated 3 months ago
- GraspSplats: Efficient Manipulation with 3D Feature Splatting☆141Updated last year
- [CVPR 2025] VidBot: Learning Generalizable 3D Actions from In-the-Wild 2D Human Videos for Zero-Shot Robotic Manipulation☆42Updated 6 months ago
- ☆86Updated 3 months ago
- Official Implementation of ARM4R ICML 2025☆53Updated 3 months ago
- A diffusion model-based stereo depth estimation framework that can predict and restore noisy depth maps for transparent and specular surf…☆87Updated 10 months ago
- [ICRA, 2025] SplatSim: Zero-Shot Sim2Real Transfer of RGB Manipulation Policies Using Gaussian Splatting☆136Updated 4 months ago
- ☆21Updated 7 months ago
- ☆183Updated 5 months ago
- Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".☆43Updated 6 months ago