Dont-Copy-That-Floppy / Real-Time-Voice-CloningLinks
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆41Updated 5 years ago
Alternatives and similar repositories for Real-Time-Voice-Cloning
Users that are interested in Real-Time-Voice-Cloning are comparing it to the libraries listed below
Sorting:
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆435Updated 4 years ago
- Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu☆253Updated 4 years ago
- ☆62Updated 4 years ago
- Audio style transfer with shallow random parameters CNN.☆406Updated 9 months ago
- ⏩ Generating speech in a single forward pass without any attention!☆579Updated last year
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework☆48Updated 2 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing t…☆862Updated 2 years ago
- Generating Digital Painting Lighting Effects via RGB-space Geometry (SIGGRAPH2020/TOG2020)☆45Updated 5 years ago
- WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆255Updated 6 years ago
- Wiki for Neural Networks. Head over to the wiki tab at the top of this site. In the future the wiki might be moved to it's own website. B…☆44Updated 6 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆170Updated 5 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- Text to Speech engine based on the Tacotron architecture, initially implemented by Keith Ito.☆585Updated 4 years ago
- Shell scripts to prepare textures for ESR/SFTGAN etc.☆30Updated 6 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆360Updated 2 years ago
- A web app that lets you play around with TalkNet models☆124Updated 2 years ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style tr…☆899Updated 2 years ago
- A gui to help make a text to speech dataset.☆18Updated 2 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- Papagayo is a lip-syncing program designed to help you line up phonemes (mouth shapes) with the actual recorded sound of actors speaking.…☆280Updated 2 years ago
- [CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting☆46Updated 4 years ago
- Desktop application for neural speech synthesis written in C++☆213Updated 2 years ago
- Copy the voice of anyone☆50Updated 8 years ago
- Python3 Text to Speech Video Sample☆93Updated 8 years ago
- speech synthesis program☆21Updated 7 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆473Updated 5 years ago
- ObamaNet : Photo-realistic lip-sync from audio (Unofficial port)☆238Updated 7 years ago
- AI Video Processing/Upscaling With VapourSynth in Google Colab☆114Updated 5 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- A TensorFlow Implementation of DC-TTS: yet another text-to-speech model☆1,162Updated 2 years ago