tanjimin / unsupervised-video-dubbing
Unsupervised video dubbing project
☆38Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for unsupervised-video-dubbing
- Controllable Face Generation via pretrained Conditional Adversarial Latent Autoencoder (ALAE)☆19Updated 4 years ago
- Audio Demo for "FastSVC: Fast Cross-Domain Singing Voice Conversion with Feature-wise Linear Modulation"☆19Updated 3 years ago
- Contrastive Language-Audio Pretraining☆15Updated 3 years ago
- ☆28Updated 2 years ago
- ☆16Updated 2 years ago
- Talking Face Generation system☆19Updated last year
- Unofficial implementation of Neural Analysis and Synthesis☆7Updated 3 years ago
- The project page repo for Neural Dubber.☆29Updated last year
- An implementation of simple diffusion in PyTorch (and JAX)☆35Updated last year
- Implementation of NWT, audio-to-video generation, in Pytorch☆87Updated 2 years ago
- Aligns faces to the canonical face in both videos and images☆17Updated 2 years ago
- A Versatile Face Encoder for Zero-Shot Diffusion Model Personalization☆19Updated this week
- PyTorch implementation of MelNet☆10Updated 5 years ago
- [NCMMSC'2024] Emotion-Aware Prosodic Phrasing for Expressive Text-to-Speech☆22Updated 3 months ago
- ☆23Updated last year
- [ICCV'21] The Right to Talk: An Audio-Visual Transformer Approach☆20Updated 3 years ago
- Implementation of Taming Transformers for High-Resolution Image Synthesis (https://arxiv.org/abs/2012.09841) in PyTorch☆16Updated 3 years ago
- Code for "SelfTalk: A Self-Supervised Commutative Training Diagram to Comprehend 3D Talking Faces" ACM MM 2023☆30Updated last year
- Official PyTorch implementation of TTS Style Transfer☆24Updated 2 years ago
- Video examples of "Appearance Composing GAN: A General Method for Appearance-Controllable Human Video Motion Transfer"☆15Updated 3 years ago
- Code for "Animating Portrait Line Drawings from a Single Face Photo and a Speech Signal"☆55Updated 2 years ago
- OpenAI CLIP based image generator with complex config file controlled transformation and training pipelines☆18Updated 2 years ago
- Implementation of VAE and Style-GAN Architecture Achieving State of the Art Reconstruction☆30Updated last year
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 3 years ago
- Deep Automodulators☆13Updated 2 years ago
- Audio-conditioned video texture generation☆25Updated 2 years ago