zju3dv/BoxDreamer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zju3dv/BoxDreamer)

zju3dv / BoxDreamer

Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.

☆108

Alternatives and similar repositories for BoxDreamer

Users that are interested in BoxDreamer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dadwadw233 / VibePortrait
View on GitHub
🎭 Know yourself as a developer. One command → AI analyzes your coding history → beautiful personality portrait + persona skill. Works wi…
☆25Apr 8, 2026Updated 3 months ago
dadwadw233 / central-voting-ppf
View on GitHub
🎯 Point cloud 6DoF pose estimation via Central Voting PPF (C++ reproduction of TIP 2021 paper).
☆13Nov 26, 2023Updated 2 years ago
ghy0324 / Watch2Read
View on GitHub
将 B 站视频转化为结构化的 Markdown 阅读笔记 —— 看视频太慢，不如读笔记。
☆62Updated this week
zju3dv / PointSplat
View on GitHub
[ECCV 2026] PointSplat: Compact Gaussian Splatting via Human-Centric Prediction
☆45Jul 14, 2026Updated last week
ByteDance-Seed / TraceAnything
View on GitHub
[ICLR 2026] Trace Anything: Representing Any Video in 4D via Trajectory Fields
☆543Oct 31, 2025Updated 8 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
HaoyiZhu / SPA
View on GitHub
[ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation
☆177Jun 19, 2025Updated last year
zju3dv / Murre
View on GitHub
Code for "Multi-view Reconstruction via SfM-guided Monocular Depth Estimation". CVPR 2025 (Oral Presentation)
☆378May 24, 2026Updated 2 months ago
zju3dv / EgoAgent
View on GitHub
Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".
☆53Jun 30, 2025Updated last year
zju3dv / Hierarchy_UGP
View on GitHub
☆39Oct 30, 2025Updated 8 months ago
xiaoxiao0406 / VQ-VLA
View on GitHub
The offical repo for paper "VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers" (ICCV 2025)
☆134Nov 15, 2025Updated 8 months ago
henry123-boy / SpaTrackerV2
View on GitHub
[ICCV 2025] SpatialTrackerV2: 3D Point Tracking Made Easy
☆984Feb 27, 2026Updated 4 months ago
facebookresearch / 4DGT
View on GitHub
[NeurIPS 2025 (Spotlight)] The implementation for the paper "4DGT Learning a 4D Gaussian Transformer Using Real-World Monocular Videos"
☆468Sep 19, 2025Updated 10 months ago
ethz-vlg / mvtracker
View on GitHub
[ICCV 2025 Oral] MVTracker: Multi-view 3D Point Tracking
☆511Nov 3, 2025Updated 8 months ago
zju3dv / Diffuman4D
View on GitHub
[ICCV 2025] Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models
☆615Apr 10, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HaoyiZhu / PointCloudMatters
View on GitHub
[NeurIPS 2024 D&B] Point Cloud Matters: Rethinking the Impact of Different Observation Spaces on Robot Learning
☆91Oct 14, 2024Updated last year
yangzhou24 / RealGRPO
View on GitHub
A Simple Way to Eliminate Reward Hacking in GRPO Diffusion Alignment
☆21Apr 14, 2026Updated 3 months ago
cupid3d / Cupid
View on GitHub
[CVPR'26 Highlight] Cupid: A 3D generator that links 2D image with camera
☆216Mar 3, 2026Updated 4 months ago
zehongs / RelativePose
View on GitHub
☆19Mar 9, 2025Updated last year
SpatialVision / Orient-Anything
View on GitHub
Orient Anything, ICML 2025
☆391Feb 6, 2026Updated 5 months ago
liruilong940607 / prope
View on GitHub
Cameras as Relative Positional Encoding
☆741Dec 18, 2025Updated 7 months ago
DepthAnything / PromptDA
View on GitHub
[CVPR 2025] Prompt Depth Anything
☆1,146Jan 29, 2026Updated 5 months ago
zju3dv / SAM-Graph
View on GitHub
Code for "SAM-guided Graph Cut for 3D Instance Segmentation" ECCV 2024
☆129Dec 31, 2024Updated last year
zbw001 / TAPIP3D
View on GitHub
TAPIP3D: Tracking Any Point in Persistent 3D Geometry
☆411Dec 28, 2025Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zju3dv / InfiniDepth
View on GitHub
[CVPR 2026] InfiniDepth: Arbitrary-Resolution and Fine-Grained Depth Estimation with Neural Implicit Fields
☆1,050Apr 3, 2026Updated 3 months ago
yangzhou24 / OmniWorld
View on GitHub
[ICLR 2026] OmniWorld: A Multi-Domain and Multi-Modal Dataset for 4D World Modeling
☆485Apr 16, 2026Updated 3 months ago
zju3dv / street_crafter
View on GitHub
[CVPR 2025] StreetCrafter: Street View Synthesis with Controllable Video Diffusion Models
☆327Jan 27, 2026Updated 5 months ago
ant-research / FLARE
View on GitHub
☆721May 1, 2025Updated last year
zju3dv / MotionStreamer
View on GitHub
[ICCV 2025] MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space
☆287Oct 28, 2025Updated 8 months ago
InternRobotics / Aether
View on GitHub
[ICCV 2025 & ICCV 2025 RIWM Outstanding Paper] Aether: Geometric-Aware Unified World Modeling
☆604Oct 26, 2025Updated 8 months ago
gangweix / pixel-perfect-depth
View on GitHub
[NeurIPS 2025] Pixel-Perfect Depth
☆1,059Feb 13, 2026Updated 5 months ago
jzr99 / Geo4D
View on GitHub
[ICCV 2025 Highlight] Geo4D: Leveraging Video Generators for Geometric 4D Scene Reconstruction
☆437Jun 6, 2025Updated last year
jcorsetti / oryon
View on GitHub
Official implementation of CVPR24 Highlight paper "Open-vocabulary object 6D pose estimation"
☆63May 8, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
phj128 / autoworker
View on GitHub
Auto-loop execution workflow with quality gates for Claude Code. Automatically decomposes tasks, implements code, runs tests, and iterate…
☆16Mar 28, 2026Updated 3 months ago
zju3dv / Scal3R
View on GitHub
[CVPR 2026 (Highlight)] Scal3R: Scalable Test-Time Training for Large-Scale 3D Reconstruction
☆513May 11, 2026Updated 2 months ago
phj128 / LearningMotion
View on GitHub
☆35Jul 25, 2023Updated 2 years ago
aimbot-reticle / openpi0-aimbot
View on GitHub
CoRL25-"AimBot: A Simple Auxiliary Visual Cue to Enhance Spatial Awareness of Visuomotor Policies"
☆50Aug 15, 2025Updated 11 months ago
tonghe90 / auto-hf-papers
View on GitHub
☆18Mar 25, 2026Updated 3 months ago
lnbxldn / Bridge
View on GitHub
Code of BRIDGE: Building Reinforcement-Learning Depth-to-Image Data Generation Engine for Monocular Depth Estimation
☆117Sep 30, 2025Updated 9 months ago
zju3dv / habitat-gs
View on GitHub
[ECCV 2026] Habitat-GS: A High-Fidelity Navigation Simulator with Dynamic Gaussian Splatting
☆252Jul 8, 2026Updated 2 weeks ago