OpenTalker / video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
☆6,376Updated last month
Related projects: ⓘ
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆11,637Updated 2 months ago
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆8,881Updated last month
- Official implementation of AnimateDiff.☆10,270Updated last month
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆4,465Updated 2 months ago
- EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine☆7,201Updated last month
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,498Updated 2 months ago
- 本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇…☆1,813Updated last year
- [CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model☆10,359Updated 2 months ago
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,216Updated 2 months ago
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆2,406Updated last month
- GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code☆2,507Updated 2 months ago
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆7,267Updated this week
- Next generation face swapper and enhancer☆17,808Updated this week
- GUI-focused roop☆4,360Updated 3 months ago
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆10,850Updated 2 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,553Updated 7 months ago
- Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on☆5,392Updated 4 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆4,398Updated last month
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,318Updated 2 months ago
- Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions☆7,405Updated 3 weeks ago
- 📷 EasyPhoto | Your Smart AI Photo Generator.☆4,905Updated 2 months ago
- Create Magic Story!☆5,787Updated last month
- Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…☆4,450Updated last week
- Real time interactive streaming digital human☆3,462Updated last week
- Outfit Anyone: Ultra-high quality virtual try-on for Any Clothing and Any Person☆5,526Updated last month
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆4,998Updated 2 months ago
- Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation☆14,346Updated last month
- 🚀🎬 ShortGPT - Experimental AI framework for youtube shorts / tiktok channel automation☆5,523Updated 7 months ago
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key☆5,259Updated 2 months ago
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆10,289Updated last week