Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.
☆103Oct 6, 2025Updated 5 months ago
Alternatives and similar repositories for BoxDreamer
Users that are interested in BoxDreamer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆39Oct 30, 2025Updated 4 months ago
- ☆13Nov 26, 2023Updated 2 years ago
- ☆19Mar 9, 2025Updated last year
- Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".☆46Jun 30, 2025Updated 8 months ago
- The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)☆112Nov 15, 2025Updated 4 months ago
- [ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation☆173Jun 19, 2025Updated 9 months ago
- [NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning☆91Oct 14, 2024Updated last year
- Orient Anything, ICML 2025☆376Feb 6, 2026Updated last month
- [CVPR 2025] StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models☆305Jan 27, 2026Updated last month
- Official implementation of CVPR24 Highlight paper "Open-vocabulary object 6D pose estimation"☆59May 8, 2025Updated 10 months ago
- [CVPR 2025] Prompt Depth Anything☆1,084Jan 29, 2026Updated last month
- TAPIP3D: Tracking Any Point in Persistent 3D Geometry☆382Dec 28, 2025Updated 2 months ago
- [ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy☆929Feb 27, 2026Updated 3 weeks ago
- Code snippets for understanding common techniques for virtual humans.☆117Feb 6, 2026Updated last month
- [CVPR 2025] TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing☆185May 22, 2025Updated 10 months ago
- ☆709May 1, 2025Updated 10 months ago
- Open source repo for Locate 3D Model, 3D-JEPA and Locate 3D Dataset☆420Jun 3, 2025Updated 9 months ago
- Using message app/bot to notify you when doing time-consuming tasks. Bake your experiments!☆90Feb 16, 2026Updated last month
- CVPR2025 | TASTE-Rob: Advancing Video Generation of Task-Oriented Hand-Object Interaction for Generalizable Robotic Manipulation☆42Jan 29, 2026Updated last month
- Cameras as Relative Positional Encoding☆690Dec 18, 2025Updated 3 months ago
- [ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields☆513Oct 31, 2025Updated 4 months ago
- SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow, CVPR2025☆55Aug 21, 2025Updated 7 months ago
- Official implementation of "Reconstructing Close Human Interaction from Multiple Views"☆40Jan 29, 2024Updated 2 years ago
- [CoRL 2024] Im2Flow2Act: Flow as the Cross-domain Manipulation Interface☆153Oct 17, 2024Updated last year
- [ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"☆516Aug 4, 2025Updated 7 months ago
- ☆25Oct 19, 2024Updated last year
- One-shot 3D Object Canonicalization based on Geometric and Semantic Consistency(CVPR highlight 2025)☆73Dec 15, 2025Updated 3 months ago
- Code for "SAM-guided Graph Cut for 3D Instance Segmentation" ECCV 2024☆127Dec 31, 2024Updated last year
- Improved 3DGS rasterizer.☆128Feb 26, 2025Updated last year
- [CVPR 2026] ZipMap: Linear-Time Stateful 3D Reconstruction via Test-Time Training☆312Mar 6, 2026Updated 2 weeks ago
- [CVPR 2025] LiDAR-RT: Gaussian-based Ray Tracing for Dynamic LiDAR Re-simulation☆188Jul 18, 2025Updated 8 months ago
- FoundPose: Unseen Object Pose Estimation with Foundation Features, ECCV 2024☆118Sep 1, 2025Updated 6 months ago
- [ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling☆581Oct 26, 2025Updated 4 months ago
- Stereo4D dataset and processing code☆298Nov 4, 2025Updated 4 months ago
- [CVPR 2025] EnvGS: Modeling View-Dependent Appearance with Environment Gaussian☆236Mar 17, 2026Updated last week
- [ICLR 2025] Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation☆45Mar 13, 2025Updated last year
- [ICCV 2025] Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models☆581Feb 12, 2026Updated last month
- Official Implementation of the paper RiEMann: Near Real-Time SE(3)-Equivariant Robot Manipulation without Point Cloud Segmentation☆39Jan 13, 2026Updated 2 months ago
- Code for "Motion-2-to-3: Leveraging 2D Motion Data to Boost 3D Motion Generation", Arxiv 2024☆104Dec 1, 2025Updated 3 months ago