zju3dv / EgoAgentLinks
Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".
☆37Updated 4 months ago
Alternatives and similar repositories for EgoAgent
Users that are interested in EgoAgent are comparing it to the libraries listed below
Sorting:
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆88Updated last month
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆195Updated last month
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆56Updated 3 months ago
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆132Updated 7 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆42Updated 3 months ago
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆39Updated 2 months ago
- [NeurIPS 2025] PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation☆64Updated 2 weeks ago
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆56Updated 7 months ago
- ☆97Updated 2 months ago
- [ICLR 2025] Ready-to-React: Online Reaction Policy for Two-Character Interaction Generation☆40Updated 8 months ago
- ☆35Updated 6 months ago
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆83Updated 4 months ago
- [NeurIPS 2025 Spotlight] Official implementation of the SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alig…☆139Updated last month
- Self-reimplemented version of 4D-LRM.☆62Updated 5 months ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆23Updated 7 months ago
- Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation☆46Updated 11 months ago
- Code and data for UniEgoMotion (ICCV 2025)☆34Updated last week
- ☆58Updated 8 months ago
- [NeurIPS 25] TrackingWorld: World-centric Monocular 3D Tracking of Almost All Pixels☆20Updated 2 months ago
- [Neurips DB 2025] PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understanding☆82Updated 2 weeks ago
- [ICLR 2025] SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects☆87Updated 7 months ago
- ☆38Updated 8 months ago
- [CVPR 2024] Physical Property Understanding from Language-Embedded Feature Fields☆82Updated last week
- Seeing World Dynamics in a Nutshell☆110Updated 8 months ago
- Official code for the paper: Can3Tok (ICCV2025)☆39Updated 3 months ago
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆134Updated 5 months ago
- [CVPR 2024] Official code for EgoGen: An Egocentric Synthetic Data Generator☆86Updated 6 months ago
- [CVPR 2024] Situational Awareness Matters in 3D Vision Language Reasoning☆42Updated 11 months ago
- ☆38Updated last year
- [NeurIPS 2025] Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"☆98Updated 2 weeks ago