OpenTalker / SadTalkerLinks
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
☆12,930Updated last year
Alternatives and similar repositories for SadTalker
Users that are interested in SadTalker are comparing it to the libraries listed below
Sorting:
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆7,092Updated 10 months ago
- 本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇…☆1,971Updated 2 years ago
- Industry leading face manipulation platform☆23,565Updated this week
- GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code☆2,608Updated 8 months ago
- [CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.☆3,581Updated last year
- WebUI extension for ControlNet☆17,699Updated 10 months ago
- MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting☆4,378Updated 2 months ago
- Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions☆7,638Updated 10 months ago
- VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models☆4,881Updated 11 months ago
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆9,452Updated 3 weeks ago
- Official implementation of AnimateDiff.☆11,536Updated 11 months ago
- AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation☆4,965Updated last year
- roop extension for StableDiffusion web-ui☆3,504Updated last year
- The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.☆6,067Updated last year
- ☆10,939Updated this week
- Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>☆4,698Updated 3 months ago
- ☆7,824Updated last year
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…☆12,117Updated last week
- Wav2Lip UHQ extension for Automatic1111☆1,384Updated last year
- InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥☆11,685Updated 11 months ago
- GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code☆1,712Updated 8 months ago
- [AAAI 2025] EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning☆3,944Updated 6 months ago
- MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising☆2,732Updated last year
- This is the Mov2mov plugin for Automatic1111/stable-diffusion-webui.☆2,203Updated 5 months ago
- Official Code for Stable Cascade☆6,588Updated 11 months ago
- 🔊 Text-Prompted Generative Audio Model☆38,091Updated 10 months ago
- Nightly release of ControlNet 1.1☆5,031Updated 10 months ago
- [ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators☆4,193Updated 2 years ago
- MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation☆2,558Updated 3 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.☆6,209Updated 6 months ago