zju3dv / EgoAgentLinks
Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".
☆26Updated 3 months ago
Alternatives and similar repositories for EgoAgent
Users that are interested in EgoAgent are comparing it to the libraries listed below
Sorting:
- ☆88Updated 3 weeks ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆39Updated 2 months ago
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆126Updated 6 months ago
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆85Updated last week
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels☆16Updated 3 weeks ago
- ☆32Updated last year
- Official code for the paper: Can3Tok (ICCV2025)☆37Updated last month
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆52Updated last month
- Official Implementation of paper "St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World"☆58Updated 3 weeks ago
- [NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…☆59Updated 2 weeks ago
- ☆38Updated 6 months ago
- ☆110Updated 3 months ago
- [ICCV 2025] This is the official implementation of POMATO: Marrying Pointmap Matching with Temporal Motions for Dynamic 3D Reconstruction☆99Updated 2 months ago
- [ICCV 2025] ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting☆54Updated 2 weeks ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆23Updated 6 months ago
- StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams☆65Updated 4 months ago
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆126Updated 4 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆179Updated 3 weeks ago
- The official implementation for "Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos".☆45Updated 4 months ago
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation☆46Updated 10 months ago
- Seeing World Dynamics in a Nutshell☆109Updated 6 months ago
- ☆34Updated 5 months ago
- Official implementation of "E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models"☆104Updated 4 months ago
- [NeurIPS 2025] PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation☆43Updated last week
- (ICCV 25) MonoFusion☆47Updated 2 months ago
- Official implementation of Forge4D: Feed-Forward 4D Human Reconstruction and Interpolation from Uncalibrated Sparse Videos☆26Updated last week
- [Arxiv'24] LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding☆34Updated last month
- ConDense backbone, weights, and evaluation code.☆31Updated last year
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆59Updated 3 months ago
- [NeurIPS 2025] Streaming 3D Reconstruction with Explicit Spatial Pointer Memory☆148Updated 2 weeks ago