hkchengrex / MMAudio
[arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
☆966Updated this week
Alternatives and similar repositories for MMAudio:
Users that are interested in MMAudio are comparing it to the libraries listed below
- FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆507Updated 5 months ago
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆560Updated this week
- Taming Stable Diffusion for Lip Sync!☆1,856Updated this week
- Diffusion-based Portrait and Animal Animation☆608Updated this week
- Official repository of In-Context LoRA for Diffusion Transformers☆1,480Updated 3 weeks ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆664Updated last month
- ☆742Updated 2 months ago
- zero-shot voice conversion & singing voice conversion, with real-time support☆904Updated last week
- A Training-free Iterative Framework for Long Story Visualization☆647Updated this week
- Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!☆606Updated this week
- Implementation of "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"☆556Updated last month
- Learning Flow Fields in Attention for Controllable Person Image Generation☆944Updated last week
- This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audi…☆314Updated this week
- ☆2,105Updated 4 months ago
- Official implementation of Posterior-Mean Rectified Flow: Towards Minimum MSE Photo-Realistic Image Restoration☆585Updated 3 months ago
- A minimal and universal controller for FLUX.1.☆1,100Updated this week
- ☆1,272Updated this week
- Dead simple FLUX LoRA training UI with LOW VRAM support☆1,726Updated last week
- Select a portrait, click to move the head around (please use your own space / GPU!)☆796Updated last month
- Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"☆1,410Updated last week
- ComfyUI nodes for LivePortrait☆1,774Updated 5 months ago
- Official Implementations for Paper - AniDoc: Animation Creation Made Easier☆443Updated 2 weeks ago
- You can using EchoMimic in ComfyUI☆498Updated this week
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆2,982Updated last month
- ☆661Updated this week
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆2,092Updated 3 months ago
- LTX-Video Support for ComfyUI☆635Updated 3 weeks ago
- a comfyui custom node for MimicMotion☆355Updated 5 months ago
- [ECCV 2024] MOFA-Video: Controllable Image Animation via Generative Motion Field Adaptions in Frozen Image-to-Video Diffusion Model.☆696Updated last month
- StoryMaker: Towards consistent characters in text-to-image generation☆628Updated last month