ligengen / EgoM2PLinks
[ICCV 2025] The official implementation for EgoM2P: Egocentric Multimodal Multitask Pretraining.
☆33Updated last week
Alternatives and similar repositories for EgoM2P
Users that are interested in EgoM2P are comparing it to the libraries listed below
Sorting:
- open-sourced video dataset with dynamic scenes and camera movements annotation☆83Updated 8 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆106Updated 9 months ago
- Seeing World Dynamics in a Nutshell☆111Updated 9 months ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆79Updated last year
- ☆269Updated 2 months ago
- UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation☆131Updated last month
- 📷 Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!☆99Updated 3 weeks ago
- Self-reimplemented version of 4D-LRM.☆65Updated 7 months ago
- The official implementation of The paper "Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation"☆92Updated 2 weeks ago
- A novel 4D reconstruction method that directly generates high-quality, animation-ready 4D mesh asset (.GLB file) from a single monocular …☆111Updated last month
- Official code for 4Diffusion: Multi-view Video Diffusion Model for 4D Generation.☆116Updated last year
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆58Updated 4 months ago
- Implementation of paper "SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model"☆42Updated last week
- [ICCV 2025] Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis☆104Updated 2 months ago
- [CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆138Updated 6 months ago
- [ECCV 2024] Official code for: SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer☆112Updated 6 months ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆24Updated 9 months ago
- DreamCinema: Cinematic Transfer with Free Camera and 3D Character☆95Updated 6 months ago
- ☆70Updated last year
- [NeurIPS 2025] GeneMAN: Generalizable Single-Image 3D Human Reconstruction from Multi-Source Human Data☆79Updated 3 months ago
- [ICLR 2025] Diffusion²: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models☆55Updated 9 months ago
- [arXiv 2025] Generative View Stitching☆99Updated 2 months ago
- Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking☆56Updated last year
- ☆94Updated 7 months ago
- [ECCV 2024] Official Implementation of DragAPart: Learning a Part-Level Motion Prior for Articulated Objects.☆83Updated last year
- Code release of our paper "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".☆133Updated 9 months ago
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆97Updated last week
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆101Updated 9 months ago
- [ICCV 2025 Findings Oral] DNF-Avatar: Distilling Neural Fields for Real-time Animatable Avatar Relighting☆40Updated last month
- An unofficial implementation of DreamScene360.☆83Updated last year