vliu15 / tts-gan

End-to-end Text-to-Speech with Generative Adversarial Networks

☆20

Related projects ⓘ

Alternatives and complementary repositories for tts-gan

speechnovateur / languagecodec_tmp
Temporary anonymous version
☆22Updated 8 months ago
Takaaki-Saeki / zm-text-tts
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆64Updated last year
francislata / unicats
An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".
☆22Updated last year
hhguo / SoCodec
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆63Updated 2 months ago
Joshua-1995 / LearnableUpsamplingLayer-Pytorch
Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)
☆54Updated 8 months ago
rhoposit / icassp2021
☆15Updated 3 years ago
shang0712 / HierTTS
☆44Updated last year
tts-tutorial / icassp2022
☆64Updated 2 years ago
rishikksh20 / Phone-Level-Mixture-Density-Network-for-TTS
Rich Prosody Diversity Modelling with Phone-level Mixture Density Network
☆45Updated 2 years ago
rishikksh20 / Zero-Shot-TTS
Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration
☆34Updated 3 years ago
choiHkk / CVAEJETS
Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech
☆46Updated 2 years ago
prml-lab-speech-team / demo
☆25Updated 3 months ago
sarulab-speech / multi-speaker-dgp
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Updated 3 years ago
LAION-AI / emotional-speech-annotations
This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models
☆30Updated last month
miccio-dk / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆16Updated 2 years ago
liuhuadai / ViT-TTS
PyTorch Implementation of ViT-TTS (EMNLP'23)
☆10Updated last year
p0p4k / Matcha-TTS-2
E2E TTS using Conditional Flow Matching (Experimental*)
☆66Updated last year
rishikksh20 / iSTFT-Avocodo-pytorch
Ultrafast GAN based Vocoder for Text to Speech
☆50Updated 2 years ago
choiHkk / VAEJETS
Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech
☆22Updated 2 years ago
keonlee9420 / Daft-Exprt
PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
☆56Updated 3 years ago
haiciyang / LaDiffCodec
☆47Updated last week
AlanBaade / SyllableLM
Official Code for SyllableLM: Learning Coarse Semantic Units for Speech Language Models
☆35Updated last month
Dapwner / CVAE-Tacotron
☆23Updated 5 months ago
rishikksh20 / UnivNet-pytorch
UnivNet: A Neural Vocoder with Multi-Resolution Spectrogram Discriminators for High-Fidelity Waveform Generation
☆72Updated 3 years ago
lifeiteng / TTS-TextAnalyzer
TTS Text Analyzer
☆32Updated last year
insunhwang89 / StyleVC
☆30Updated last year
ex3ndr / supervoice-librilight-preprocessed
60k hours of phoneme-aligned audio from audio books
☆18Updated 3 months ago
RF5 / simple-asgan
Training code and trained checkpoints for ASGAN.
☆60Updated 10 months ago