caizexin/tf_multispeakerTTS_fc

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/caizexin/tf_multispeakerTTS_fc)

caizexin / tf_multispeakerTTS_fc

the Tensorflow version of multi-speaker TTS training with feedback constraint

☆40

Alternatives and similar repositories for tf_multispeakerTTS_fc

Users that are interested in tf_multispeakerTTS_fc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sos1sos2Sixteen / aishell-3-baseline-fc
View on GitHub
The code for aishell-3 baseline acoustic model
☆70Nov 30, 2020Updated 5 years ago
Lukelluke / MCD-MEL-CEPSTRAL-DISTANCE-MCD-application
View on GitHub
Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav fi…
☆22Sep 4, 2020Updated 5 years ago
smoke-trees / Voice-synthesis
View on GitHub
This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…
☆170Sep 25, 2020Updated 5 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
nii-yamagishilab / multi-speaker-tacotron
View on GitHub
VCTK multi-speaker tacotron for ICASSP 2020
☆266Mar 29, 2022Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
bajibabu / make_full_labels
View on GitHub
how to generate the full-contextual labels from un-seen text for the application of HMM-based speech synthesis (HTS)
☆12Nov 22, 2019Updated 6 years ago
BridgetteSong / ExpressiveTacotron
View on GitHub
This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…
☆74Sep 21, 2022Updated 3 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
maum-ai / cotatron
View on GitHub
Official code for Cotatron @ INTERSPEECH 2020
☆213Jul 25, 2024Updated 2 years ago
meelement / noise_adversarial_tacotron
View on GitHub
Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…
☆17Aug 15, 2019Updated 6 years ago
bigpon / vcc20_baseline_cyclevae
View on GitHub
Voice Conversion Challenge 2020 CycleVAE baseline system
☆131Oct 19, 2020Updated 5 years ago
ttslr / python-MCD
View on GitHub
☆49May 3, 2020Updated 6 years ago
SJTMusicTeam / SVS_system
View on GitHub
A system works on singing voice synthesis
☆79Jan 11, 2023Updated 3 years ago
shackysureshot / Mel-Cepstral-Distortion
View on GitHub
Calculation of MCD (dB) between two speech waveforms
☆57Sep 26, 2020Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
sony / bigvsan_eval
View on GitHub
Evaluation tool used in the BigVSAN paper
☆14Mar 22, 2024Updated 2 years ago
shaojinding / Adversarial-Many-to-Many-VC
View on GitHub
[InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …
☆39Mar 24, 2023Updated 3 years ago
LeoniusChen / Attentions-in-Tacotron
View on GitHub
☆69Mar 31, 2021Updated 5 years ago
CMsmartvoice / Unet-TTS
View on GitHub
One-shot TTS with Improved Unseen Speaker and Style Transfer
☆37Mar 2, 2022Updated 4 years ago
inconnu11 / Objective-evaluation_speech_synthesis
View on GitHub
☆17Mar 24, 2022Updated 4 years ago
Yablon / auorange
View on GitHub
Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet
☆62Jun 8, 2021Updated 5 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
dipjyoti92 / SC-WaveRNN
View on GitHub
Official PyTorch implementation of Speaker Conditional WaveRNN
☆110Jun 22, 2022Updated 4 years ago
CSTR-Edinburgh / ophelia
View on GitHub
Sequence-to-sequence TTS based on Kyubyong's dc_tts
☆61Feb 2, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
View on GitHub
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Jan 27, 2020Updated 6 years ago
BridgetteSong / BunchedLPCnet
View on GitHub
This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.
☆14Jun 17, 2021Updated 5 years ago
mutiann / few-shot-transformer-tts
View on GitHub
Byte-based multilingual transformer TTS for low-resource/few-shot language adaptation.
☆87Jul 25, 2022Updated 4 years ago
andi611 / ZeroSpeech-TTS-without-T
View on GitHub
A Pytorch implementation for the ZeroSpeech 2019 challenge.
☆112Nov 12, 2019Updated 6 years ago
ivanvovk / compressed-tacotron2-pytorch
View on GitHub
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
☆22Dec 26, 2019Updated 6 years ago
jinhan / tacotron2-vae
View on GitHub
Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"
☆170Jul 6, 2023Updated 3 years ago
Tomiinek / Multilingual_Text_to_Speech
View on GitHub
An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.
☆844Oct 10, 2023Updated 2 years ago
ide8 / tacotron2
View on GitHub
Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow
☆128Apr 9, 2021Updated 5 years ago
zceng / LVCNet
View on GitHub
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
☆80Feb 24, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
One-Shot-Voice-Conversion-with-WIN / WINVC
View on GitHub
Official implementation of "WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization".
☆30Nov 13, 2021Updated 4 years ago
thuhcsi / Crystal
View on GitHub
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
☆230Aug 17, 2020Updated 5 years ago
MingjieChen / LowResourceVC
View on GitHub
Voice conversion training with 109 speakers with limited training samples
☆35Dec 21, 2020Updated 5 years ago
MingjieChen / VoiceConversionGANs
View on GitHub
GAN series for voice conversion on VCC2018 dataset
☆17Aug 27, 2020Updated 5 years ago
bshall / UniversalVocoding
View on GitHub
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆238Nov 14, 2020Updated 5 years ago
thuhcsi / icassp2021-emotion-tts
View on GitHub
Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/
☆34Mar 17, 2023Updated 3 years ago
janvainer / speedyspeech
View on GitHub
☆262Dec 8, 2022Updated 3 years ago