Rudrabha / Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
☆10,289Updated last week
Related projects: ⓘ
- [CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation☆11,637Updated 2 months ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild☆6,376Updated last month
- Let us control diffusion models!☆29,761Updated 6 months ago
- 🔊 Text-Prompted Generative Audio Model☆35,297Updated last month
- Real-time face swap for PC streaming or video calls☆26,048Updated last year
- [CVPR 2022] Thin-Plate Spline Motion Model for Image Animation.☆3,433Updated 7 months ago
- one-click face swap☆27,866Updated last month
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…☆20,607Updated 2 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆33,451Updated last month
- Next generation face swapper and enhancer☆17,808Updated this week
- Official implementation of AnimateDiff.☆10,270Updated last month
- A multi-voice TTS system trained with an emphasis on quality☆12,898Updated last month
- Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)☆7,267Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/☆7,553Updated 7 months ago
- WebUI extension for ControlNet☆16,812Updated last month
- StableLM: Stability AI Language Models☆15,842Updated 5 months ago
- InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and cre…☆22,843Updated this week
- GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code☆2,507Updated 2 months ago
- Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)☆25,294Updated 2 weeks ago
- 🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.☆25,168Updated this week
- A Gradio web UI for Large Language Models.☆39,557Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translation☆10,755Updated last month
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.☆50,778Updated this week
- FaceChain is a deep-learning toolchain for generating your Digital-Twin.☆8,881Updated last month
- so-vits-svc fork with realtime support, improved interface and more features.☆8,674Updated this week
- Community interface for generative AI☆8,677Updated 4 months ago
- Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions☆7,405Updated 3 weeks ago
- ☆7,642Updated 5 months ago
- Generate 3D objects conditioned on text or images☆11,553Updated 2 months ago
- Easily train a good VC model with voice data <= 10 mins!☆22,944Updated 2 weeks ago