zju3dv / BoxDreamerLinks
Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.
☆82Updated 2 months ago
Alternatives and similar repositories for BoxDreamer
Users that are interested in BoxDreamer are comparing it to the libraries listed below
Sorting:
- InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆164Updated last week
- ☆84Updated last week
- ☆58Updated 2 months ago
- Official implementation of CVPR25 paper "Decompositional Neural Scene Reconstruction with Generative Diffusion Prior"☆94Updated 5 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆56Updated 5 months ago
- [ICCV 2025] ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting☆47Updated this week
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆59Updated 2 months ago
- [ICLR 2025] SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects☆81Updated 5 months ago
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels☆16Updated this week
- [CVPR 2024] Official Implementation of the paper "CAGE: Controllable Articulation GEneration"☆81Updated 6 months ago
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆34Updated last month
- ☆121Updated 5 months ago
- [NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…☆30Updated this week
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆123Updated 5 months ago
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation☆46Updated 9 months ago
- [CVPR 2024] Physical Property Understanding from Language-Embedded Feature Fields☆80Updated last year
- ☆38Updated 6 months ago
- Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".☆25Updated 2 months ago
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆79Updated 2 months ago
- PhyRecon: Physically Plausible Neural Scene Reconstruction☆171Updated 6 months ago
- [CVPR 2025] Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video☆194Updated 3 months ago
- CVPR2025 | TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation☆25Updated 3 weeks ago
- ☆80Updated last year
- StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams☆63Updated 3 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆39Updated last month
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆127Updated 3 months ago
- ☆37Updated last year
- Open-world 3D part segmentation of point clouds☆85Updated last month
- [ICCV 2025] Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis☆86Updated last month
- OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling☆264Updated last week