smoke-trees/Voice-synthesis

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/smoke-trees/Voice-synthesis)

smoke-trees / Voice-synthesis

This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) with a vocoder that works in real-time. SV2TTS is a three-stage deep learning framework that allows to create a numerical representation of a voice from a few seconds of audio, and to use it to condition a text-to…

☆170

Alternatives and similar repositories for Voice-synthesis

Users that are interested in Voice-synthesis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

caizexin / tf_multispeakerTTS_fc
View on GitHub
the Tensorflow version of multi-speaker TTS training with feedback constraint
☆40Oct 12, 2020Updated 5 years ago
SforAiDl / Neural-Voice-Cloning-With-Few-Samples
View on GitHub
This repository has implementation for "Neural Voice Cloning With Few Samples"
☆433Feb 23, 2021Updated 5 years ago
Tomiinek / Multilingual_Text_to_Speech
View on GitHub
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
☆844Oct 10, 2023Updated 2 years ago
VisionBrain / Neural_Voice_Cloning
View on GitHub
Open Source Implementation of Neural Voice Cloning with Few Audio Samples (Baidu Research)
☆17Oct 12, 2020Updated 5 years ago
deterministic-algorithms-lab / Cross-Lingual-Voice-Cloning
View on GitHub
Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.
☆359Mar 25, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sos1sos2Sixteen / aishell-3-baseline-fc
View on GitHub
The code for aishell-3 baseline acoustic model
☆70Nov 30, 2020Updated 5 years ago
nii-yamagishilab / multi-speaker-tacotron
View on GitHub
VCTK multi-speaker tacotron for ICASSP 2020
☆266Mar 29, 2022Updated 4 years ago
CMsmartvoice / One-Shot-Voice-Cloning
View on GitHub
One Shot Voice Cloning base on Unet-TTS
☆243Mar 22, 2022Updated 4 years ago
Edresson / GE2E-Speaker-Encoder
View on GitHub
GE2E Speaker Encoder - Generalized End-To-End Loss for Speaker Verification
☆14May 17, 2020Updated 6 years ago
Sharad24 / Neural-Voice-Cloning-with-Few-Samples
View on GitHub
Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu
☆253Feb 23, 2021Updated 5 years ago
CODEJIN / GST_Tacotron
View on GitHub
Implementation of Global Style Token Tacotron in TensorFlow2
☆26Sep 28, 2020Updated 5 years ago
dmse4tts / DMSE4TTS
View on GitHub
☆24May 6, 2025Updated last year
jinhan / tacotron2-vae
View on GitHub
Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"
☆169Jul 6, 2023Updated 3 years ago
voice-cloning-app / Voice-Cloning-App
View on GitHub
A Python/Pytorch app for easily synthesising human voices
☆1,441Dec 2, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cnlinxi / tpse_tacotron2
View on GitHub
TPSE-GST Tacotron2
☆14May 1, 2019Updated 7 years ago
maum-ai / cotatron
View on GitHub
Official code for Cotatron @ INTERSPEECH 2020
☆213Jul 25, 2024Updated last year
vlomme / Multi-Tacotron-Voice-Cloning
View on GitHub
Phoneme multilingual(Russian-English) voice cloning based on
☆395Feb 7, 2021Updated 5 years ago
jinhan / tacotron2-gst
View on GitHub
Tacotron2 with Global Style Tokens
☆64Apr 19, 2019Updated 7 years ago
freenowill / AutoVC-WavRNN
View on GitHub
voice conversion system
☆25Jun 10, 2020Updated 6 years ago
hs-oh-prml / EmotionControllableTextToSpeech
View on GitHub
☆21Jun 16, 2021Updated 5 years ago
Talish-wiz / Voice-Cloning
View on GitHub
This repository will help you to clone voice to generate an arbitrary speech in real time
☆12Apr 24, 2020Updated 6 years ago
keonlee9420 / Parallel-Tacotron2
View on GitHub
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
☆191Nov 18, 2021Updated 4 years ago
sagar-spkt / SV2MTTS
View on GitHub
Voice Cloning using SV with GE2E and Tacotron
☆12Mar 25, 2023Updated 3 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ide8 / tacotron2
View on GitHub
Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
☆128Apr 9, 2021Updated 5 years ago
auspicious3000 / autovc
View on GitHub
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
☆1,099Oct 23, 2024Updated last year
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
ob1y2k / publitio_android_sdk
View on GitHub
Simple Android SDK for Publitio
☆10Jan 16, 2021Updated 5 years ago
rishikksh20 / vae_tacotron2
View on GitHub
VAE Tacotron 2, an alternative of GST Tacotron
☆91Jul 6, 2023Updated 3 years ago
keonlee9420 / Daft-Exprt
View on GitHub
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
☆55Oct 15, 2021Updated 4 years ago
RandomInternetPreson / text-generation-webui-barktts
View on GitHub
A simple extension that uses Bark Text-to-Speech for audio output
☆10Nov 20, 2023Updated 2 years ago
arcosx / CodeInterpreter
View on GitHub
The Best Open Source LLM Code Interpreter
☆17Sep 2, 2023Updated 2 years ago
r9y9 / deepvoice3_pytorch
View on GitHub
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
☆1,978Dec 19, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
meelement / noise_adversarial_tacotron
View on GitHub
Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…
☆17Aug 15, 2019Updated 6 years ago
awesome-archive / tacotron_cn
View on GitHub
chinese_tacotron-2
☆12Feb 27, 2018Updated 8 years ago
kstoneriv3 / Fake-Voice-Detection
View on GitHub
For "Deep Learning class" at ETHZ. Evaluate how well the fake voice of Barack Obama 1. confuses the voice verification system, 2. can be …
☆34May 22, 2023Updated 3 years ago
chenjiaxiang / Chinese-dataset-for-speaker-identification
View on GitHub
☆10Sep 17, 2021Updated 4 years ago
jxzhanggg / nonparaSeq2seqVC_code
View on GitHub
Implementation code of non-parallel sequence-to-sequence VC
☆248Mar 24, 2023Updated 3 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
TensorSpeech / TensorFlowTTS
View on GitHub
TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…
☆3,992Jul 5, 2024Updated 2 years ago