zju3dv / EgoAgentLinks
Official implementation of ICCV 2025 paper "EgoAgent: A Joint Predictive Agent Model in Egocentric Worlds".
☆45Updated 6 months ago
Alternatives and similar repositories for EgoAgent
Users that are interested in EgoAgent are comparing it to the libraries listed below
Sorting:
- Code for "BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation", ICCV 2025.☆101Updated 3 months ago
- [ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction☆42Updated 5 months ago
- [NeurIPS 2025] PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation☆103Updated last week
- DELTA: Dense Efficient Long-range 3D Tracking for Any video (ICLR 2025)☆137Updated 9 months ago
- Official Reporsitory of "EgoMono4D: Self-Supervised Monocular 4D Scene Reconstruction for Egocentric Videos"☆40Updated 4 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆58Updated 5 months ago
- Official Implementation of paper "St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World"☆106Updated 4 months ago
- [NeurIPS 2025] InternScenes: A Large-scale Interactive Indoor Scene Dataset with Realistic Layouts.☆219Updated 3 months ago
- ☆107Updated 4 months ago
- Official code for paper: "RayRoPE: Projective Ray Positional Encoding for Multi-view Attention"☆34Updated this week
- [NeurIPS 24] The implementation and dataset of LiveScene: Language Embedding Interactive Radiance Fields for Physical Scene Rendering and…☆60Updated 9 months ago
- Public code for XFactor: Introduces the first geometry-free model to achieve true self-supervised / pose-free Novel View Synthesis (NVS) …☆89Updated 3 months ago
- Official implementation of Video-DPM☆134Updated last week
- Seeing World Dynamics in a Nutshell☆111Updated 10 months ago
- ☆39Updated 10 months ago
- [ARXIV’25] Learning Video Generation for Robotic Manipulation with Collaborative Trajectory Control☆88Updated 6 months ago
- Self-reimplemented version of 4D-LRM.☆65Updated 7 months ago
- Official implementation of "NoiseAR: AutoRegressing Initial Noise Prior for Diffusion Models"☆18Updated 7 months ago
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation☆135Updated 7 months ago
- SPAgent, a spatial intelligence agent designed to operate in the physical and spatial world.☆56Updated last week
- Official Code Release of NeurIPS 2025 Paper: HoloScene: Simulation‑Ready Interactive 3D Worlds from a Single Video☆83Updated 3 months ago
- ☆123Updated 7 months ago
- Code and data for UniEgoMotion (ICCV 2025)☆41Updated 2 months ago
- Official code for the paper: Can3Tok (ICCV2025)☆39Updated 5 months ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆24Updated 9 months ago
- Code for "Motion-2-to-3: Leveraging 2D Motion Data to Boost 3D Motion Generation", Arxiv 2024☆103Updated last month
- ☆37Updated 2 weeks ago
- [CVPR 2025] Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields☆32Updated 3 months ago
- ☆32Updated 2 years ago
- a open-source Self-Reimplemented Version of the paper "RayZer: A Self-supervised Large View Synthesis Model"☆31Updated 3 weeks ago