ligengen / EgoM2PLinks
[ICCV 2025] The official implementation for EgoM2P: Egocentric Multimodal Multitask Pretraining.
☆34Updated 3 weeks ago
Alternatives and similar repositories for EgoM2P
Users that are interested in EgoM2P are comparing it to the libraries listed below
Sorting:
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆106Updated 10 months ago
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆78Updated last year
- open-sourced video dataset with dynamic scenes and camera movements annotation☆83Updated 9 months ago
- Seeing World Dynamics in a Nutshell☆111Updated 10 months ago
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆99Updated last month
- UniLat3D: Geometry-Appearance Unified Latents for Single-Stage 3D Generation☆133Updated 2 months ago
- ☆278Updated 3 months ago
- Implementation of paper "SceneMaker: Open-set 3D Scene Generation with Decoupled De-occlusion and Pose Estimation Model"☆70Updated last month
- ☆70Updated last year
- The official implementation of The paper "Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation"☆95Updated last month
- DreamCinema: Cinematic Transfer with Free Camera and 3D Character☆95Updated 7 months ago
- [ICCV 2025] Gaussian Variation Field Diffusion for High-fidelity Video-to-4D Synthesis☆107Updated 2 months ago
- [ECCV 2024] Official code for: SC4D: Sparse-Controlled Video-to-4D Generation and Motion Transfer☆112Updated 7 months ago
- UniUGG: Unified 3D Understanding and Generation via Geometric-Semantic Encoding☆58Updated 5 months ago
- VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model☆181Updated last year
- Official PyTorch implementation of "A Unified Approach for Text- and Image-guided 4D Scene Generation", [CVPR 2024]☆93Updated last year
- 📷 Camera-controlled text-to-video generation, now with intrinsics, distortion and orientation control!☆113Updated 3 weeks ago
- Official code for 4Diffusion: Multi-view Video Diffusion Model for 4D Generation.☆118Updated last year
- Self-reimplemented version of 4D-LRM.☆65Updated 8 months ago
- [ICLR 2025] MVTokenFlow: High-quality 4D Content Generation using Multiview Token Flow☆24Updated 9 months ago
- Omni Controllable Video Diffusion☆37Updated last month
- Code release of our paper "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".☆134Updated 10 months ago
- Official PyTorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆101Updated 9 months ago
- [ICLR 2025] Diffusion²: Dynamic 3D Content Generation via Score Composition of Video and Multi-view Diffusion Models☆56Updated 10 months ago
- Official repository for "SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE"☆190Updated 7 months ago
- [CVPR2025] Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆140Updated 6 months ago
- Hyper-3DG Project; Accepted by International Journal of Computer Vision (IJCV)☆51Updated last year
- TC4D: Trajectory-Conditioned Text-to-4D Generation☆203Updated last year
- (CVPR2025 Highlight) Official repository of paper "Panorama Generation From NFoV Image Done Right"☆18Updated 8 months ago
- [CVPR 2025🔥] Official codebase for "Global-Local Tree Search in VLMs for 3D Indoor Scene Generation"☆20Updated 9 months ago