This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to…
☆170Sep 25, 2020Updated 5 years ago
Alternatives and similar repositories for Voice-synthesis
Users that are interested in Voice-synthesis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Oct 12, 2020Updated 5 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆34Jun 22, 2022Updated 4 years ago
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆432Feb 23, 2021Updated 5 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.☆844Oct 10, 2023Updated 2 years ago
- Open Source Implementation of Neural Voice Cloning with Few Audio Samples (Baidu Research)☆17Oct 12, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆359Mar 25, 2023Updated 3 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Mar 29, 2022Updated 4 years ago
- The code for aishell-3 baseline acoustic model☆69Nov 30, 2020Updated 5 years ago
- One Shot Voice Cloning base on Unet-TTS☆245Mar 22, 2022Updated 4 years ago
- GE2E Speaker Encoder - Generalized End-To-End Loss for Speaker Verification☆14May 17, 2020Updated 6 years ago
- Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu☆253Feb 23, 2021Updated 5 years ago
- Implementation of Global Style Token Tacotron in TensorFlow2☆26Sep 28, 2020Updated 5 years ago
- A Python/Pytorch app for easily synthesising human voices☆1,438Dec 2, 2024Updated last year
- Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"☆169Jul 6, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆24May 6, 2025Updated last year
- TPSE-GST Tacotron2☆14May 1, 2019Updated 7 years ago
- Official code for Cotatron @ INTERSPEECH 2020☆214Jul 25, 2024Updated last year
- Phoneme multilingual(Russian-English) voice cloning based on☆394Feb 7, 2021Updated 5 years ago
- Tacotron2 with Global Style Tokens☆64Apr 19, 2019Updated 7 years ago
- voice conversion system☆25Jun 10, 2020Updated 6 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling☆190Nov 18, 2021Updated 4 years ago
- Voice Cloning using SV with GE2E and Tacotron☆12Mar 25, 2023Updated 3 years ago
- ☆21Jun 16, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Apr 9, 2021Updated 5 years ago
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,097Oct 23, 2024Updated last year
- ☆15May 8, 2021Updated 5 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆91Jul 6, 2023Updated 2 years ago
- Remotion text-to-speech template using Google Cloud and Firebase☆18Feb 20, 2026Updated 4 months ago
- This repository will help you to clone voice to generate an arbitrary speech in real time☆12Apr 24, 2020Updated 6 years ago
- PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models☆1,978Dec 19, 2023Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆55Oct 15, 2021Updated 4 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Aug 15, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- chinese_tacotron-2☆12Feb 27, 2018Updated 8 years ago
- Demo for Shell Fur Add-on for Godot☆12Aug 8, 2022Updated 3 years ago
- ☆10Sep 17, 2021Updated 4 years ago
- text to speech☆10Mar 19, 2024Updated 2 years ago
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…☆3,993Jul 5, 2024Updated last year
- Full LAKH MIDI dataset converted to MuseNet MIDI output format (9 instruments + drums)☆18Jan 12, 2022Updated 4 years ago
- This utility allows one to cut multiple clips from a single or multiple audio files.☆19Apr 13, 2026Updated 2 months ago