shawwn / Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆53Updated 5 years ago
Alternatives and similar repositories for Real-Time-Voice-Cloning:
Users that are interested in Real-Time-Voice-Cloning are comparing it to the libraries listed below
- A gui to help make a text to speech dataset.☆18Updated 2 years ago
- Convert text into beautiful artistic images☆58Updated last year
- This repository is a repository for the paper, "Irgun: Improved residue based gradual up-scaling network for single image super resolutio…☆12Updated 4 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆359Updated last year
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 5 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆168Updated 4 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆229Updated 2 years ago
- Unsupervised Any-to-many Audiovisual Synthesis via Exemplar Autoencoders☆120Updated 2 years ago
- Non official project based on original /r/Deepfakes thread. Many thanks to him!☆15Updated 4 years ago
- Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.☆97Updated 2 years ago
- Minecraft GAN☆44Updated 3 years ago
- PoP ArT☆21Updated 2 years ago
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆435Updated 3 years ago
- Simple deepfake pet-project☆18Updated 5 years ago
- Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆208Updated 6 months ago
- GANfolk are AI-generated renderings of fictional people. Each image in the collection was created by a pair of Generative Adversarial Net…☆35Updated 3 years ago
- Keras implementations of Tacotron-2☆27Updated 4 years ago
- Live real-time avatars from your webcam in the browser. No dedicated hardware or software installation needed. A pure Google Colab wrappe…☆352Updated last month
- PyTorch Implementation of "Facial Image-to-Video Translation by a Hidden Affine Transformation" in MM'19.☆55Updated 5 years ago
- AutomEditor is an AI based video editor that helps video bloggers to remove bloopers automatically. It uses multimodal spatio-temporal bl…☆47Updated 5 years ago
- ☆64Updated 4 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 3 years ago
- Synthesizing and manipulating 2048x1024 images with conditional GANs☆61Updated 5 years ago
- This is a phonemic multilingual (Russian-English) Implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-S…☆52Updated 4 years ago
- Image Processing, Speech Processing, Encoder Decoder, Research Paper implementation☆60Updated 4 years ago
- Pytorch implementation of Dance Dance Generation: Motion Transfer for Internet Videos☆44Updated 5 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- ☆22Updated 3 years ago
- AI-generated talking head video of fake people responding to your input question text.☆68Updated 3 years ago