HumanAIGC / EMO
Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
☆7,503Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for EMO
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,652Updated 4 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,121Updated 4 months ago
- Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation☆14,496Updated 3 months ago
- [CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model☆10,485Updated 4 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,465Updated 4 months ago
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,278Updated 3 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆6,652Updated 3 months ago
- Accepted as [NeurIPS 2024] Spotlight Presentation Paper☆5,955Updated last month
- Character Animation (AnimateAnyone, Face Reenactment)☆3,185Updated 5 months ago
- Kolors Team☆3,872Updated last week
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆11,992Updated 4 months ago
- Official implementations for paper: Anydoor: zero-shot object-level image customization☆4,006Updated 7 months ago
- EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning☆2,951Updated this week
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆1,617Updated 10 months ago
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,345Updated 5 months ago
- Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on☆5,607Updated 6 months ago
- High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance☆1,899Updated last month
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆2,859Updated this week
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆4,827Updated 3 months ago
- ☆2,373Updated 6 months ago
- GUI-focused roop☆4,600Updated 5 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,589Updated 2 months ago
- Bring portraits to life!☆13,012Updated last week
- V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.☆2,255Updated last week
- Zero-Shot Speech Editing and Text-to-Speech in the Wild☆7,645Updated 4 months ago
- Inference and training library for high-quality TTS models.☆4,658Updated 3 weeks ago
- More relighting!☆5,545Updated 3 weeks ago
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆5,283Updated 4 months ago
- Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.☆3,721Updated 2 months ago
- [ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild☆3,935Updated 2 weeks ago