Skaldak / MfH
☆11Updated 4 months ago
Alternatives and similar repositories for MfH:
Users that are interested in MfH are comparing it to the libraries listed below
- [CVPR 2024] Customize your NeRF: Adaptive Source Driven 3D Scene Editing via Local-Global Iterative Training☆40Updated last year
- This is the official implementation of VideoMaker: Zero-shot Customized Video Generation with the Inherent Force of Video Diffusion Mode…☆13Updated last month
- [ICLR2025] GenPercept: Diffusion Models Trained with Large Data Are Transferable Visual Models☆181Updated 3 months ago
- A curated list of awesome autoregressive papers in Generative AI☆52Updated last week
- ☆58Updated 2 months ago
- Pytorch implementation of GaussianToken: An Effective Image Tokenizer with 2D Gaussian Splatting☆78Updated 3 weeks ago
- official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"☆125Updated last month
- An organized list of academic papers focused on the topic of 4D Generation. If you have any additions or suggestions, feel free to contri…☆56Updated last year
- [ICLR 2025] Trajectory Attention For Fine-grained Video Motion Control☆69Updated last week
- "Comp4D: Compositional 4D Scene Generation", Dejia Xu*, Hanwen Liang*, Neel P. Bhatt, Hezhen Hu, Hanxue Liang, Konstantinos N. Platanioti…☆79Updated 8 months ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆84Updated last month
- Official implementation for WorldScore: A Unified Evaluation Benchmark for World Generation☆92Updated last week
- Official code for "Amodal Completion via Progressive Mixed Context Diffusion" [CVPR 2024 Highlight]☆44Updated 9 months ago
- Official implementation of “4D LangSplat: 4D Language Gaussian Splatting via Multimodal Large Language Models” (CVPR 2025)☆91Updated last week
- Prometheus: 3D-Aware Latent Diffusion Models for Feed-Forward Text-to-3D Scene Generation☆90Updated 3 months ago
- [NeurIPS 2024] DreamScene4D: Dynamic Multi-Object Scene Generation from Monocular Videos☆224Updated 6 months ago
- Code release for "PISA Experiments: Exploring Physics Post-Training for Video Diffusion Models by Watching Stuff Drop" (arXiv 2025)☆28Updated last month
- TC4D: Trajectory-Conditioned Text-to-4D Generation☆188Updated 6 months ago
- Seeing World Dynamics in a Nutshell☆105Updated last month
- Source code for paper GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking☆44Updated 3 months ago
- Official implementation of "Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness".☆19Updated 3 weeks ago
- open-sourced video dataset with dynamic scenes and camera movements annotation☆50Updated this week
- [ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models☆228Updated 3 months ago
- [CVPR 2025] The code for paper ''Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding''.☆80Updated last month
- Official repository for "SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE"☆140Updated 3 weeks ago
- GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography☆43Updated 2 weeks ago
- Frequency Autoregressive Image Generation with Continuous Tokens☆56Updated last month
- [NeurIPS 2023] Free-Bloom: Zero-Shot Text-to-Video Generator with LLM Director and LDM Animator☆96Updated last year
- Official Implementation of VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention☆34Updated last week
- [ECCV 2024] Viewpoint Textual Inversion: Discovering Scene Representations and 3D View Control in 2D Diffusion Models☆109Updated 4 months ago