zju3dv / EgoAgentLinks
Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".
☆36Updated 4 months ago
Alternatives and similar repositories for EgoAgent
Users that are interested in EgoAgent are comparing it to the libraries listed below
Sorting:
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆87Updated 3 weeks ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆41Updated 2 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆188Updated 2 weeks ago
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆127Updated 6 months ago
- ☆91Updated last month
- [NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…☆124Updated last month
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆54Updated 2 months ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆56Updated 7 months ago
- ☆35Updated 5 months ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆23Updated 6 months ago
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels☆17Updated last month
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation☆46Updated 10 months ago
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆82Updated 3 months ago
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆37Updated last month
- ☆47Updated 4 months ago
- Trace Anything: Representing Any Video in 4D via Trajectory Fields☆352Updated this week
- Geometry-aware 4D Video Generation for Robot Manipulation☆62Updated 2 months ago
- Official implementation of GaussianProperty: Integrating Physical Properties to 3D Gaussians with LMMs.☆60Updated 3 months ago
- ☆63Updated 3 months ago
- [ICLR 2025] Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation☆40Updated 7 months ago
- Seeing World Dynamics in a Nutshell☆110Updated 7 months ago
- ConDense backbone, weights, and evaluation code.☆30Updated last year
- Unifying 2D and 3D Vision-Language Understanding☆115Updated 3 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆102Updated 7 months ago
- Official Implementation of paper "St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World"☆76Updated last month
- ☆32Updated last year
- Self-reimplemented version of 4D-LRM.☆60Updated 5 months ago
- [ICCV 2025] ObjectGS: Object-aware Scene Reconstruction and Scene Understanding via Gaussian Splatting☆59Updated 3 weeks ago
- [NeurIPS 2025] PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation☆57Updated last week
- Official code for the paper: Can3Tok (ICCV2025)☆38Updated 2 months ago