π€π¬ Transformer TTS: Implementation of a non-autoregressive Transformer based neural network for text to speech.
β1,159May 3, 2024Updated last year
Alternatives and similar repositories for TransformerTTS
Users that are interested in TransformerTTS are comparing it to the libraries listed below
Sorting:
- A Pytorch Implementation of "Neural Speech Synthesis with Transformer Network"β690Nov 8, 2023Updated 2 years ago
- β© Generating speech in a single forward pass without any attention!β581Feb 20, 2026Updated last week
- TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germaβ¦β3,994Jul 5, 2024Updated last year
- MelGAN vocoder (compatible with NVIDIA/tacotron2)β650Oct 3, 2020Updated 5 years ago
- A Generative Flow for Text-to-Speech via Monotonic Alignment Searchβ702Jul 12, 2022Updated 3 years ago
- The Implementation of FastSpeech based on pytorch.β880Jul 6, 2023Updated 2 years ago
- An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.β844Oct 10, 2023Updated 2 years ago
- Mellotron: a multispeaker voice synthesis model based on Tacotron 2 GST that can make a voice emote and sing without emotive or singing tβ¦β866Jul 22, 2023Updated 2 years ago
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorchβ1,638Apr 22, 2024Updated last year
- List of speech synthesis papers.β1,066Jul 24, 2023Updated 2 years ago
- WaveRNN Vocoder + TTSβ2,177Jul 2, 2022Updated 3 years ago
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"β2,157Oct 27, 2023Updated 2 years ago
- Flowtron is an auto-regressive flow-based generative network for text to speech synthesis with control over speech variation and style trβ¦β900Jul 6, 2023Updated 2 years ago
- A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised durationβ¦β328Sep 24, 2022Updated 3 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesisβ2,320Jul 27, 2024Updated last year
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.β360Mar 25, 2023Updated 2 years ago
- β262Dec 8, 2022Updated 3 years ago
- PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modelingβ191Nov 18, 2021Updated 4 years ago
- VCTK multi-speaker tacotron for ICASSP 2020β266Mar 29, 2022Updated 3 years ago
- Unsupervised Speech Decomposition Via Triple Information Bottleneckβ698Oct 23, 2024Updated last year
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Networkβ321Jul 25, 2024Updated last year
- Implementation of "Duration Informed Attention Network for Multimodal Synthesis" paper in PyTorch.β184Aug 12, 2020Updated 5 years ago
- πΈ collection of TTS papersβ724Jul 4, 2024Updated last year
- PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speechβ232Jun 22, 2022Updated 3 years ago
- A Neural Grapheme-to-Phoneme Conversion Package for Mandarin Chinese Based on a New Open Benchmark Datasetβ361Dec 24, 2021Updated 4 years ago
- PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generationβ197Feb 10, 2022Updated 4 years ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!β360Apr 27, 2022Updated 3 years ago
- This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.β601Sep 18, 2023Updated 2 years ago
- A pytroch implementation of the EETS: End-to-End Adversarial Text-to-Speechβ127Jul 16, 2020Updated 5 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.β157Jul 2, 2021Updated 4 years ago
- Simple text to phones converter for multiple languagesβ1,513Sep 26, 2024Updated last year
- End-to-End Speech Processing Toolkitβ9,747Updated this week
- A summary on our attempts at using Deep Learning approaches for Emotional Text to Speechβ458Jun 26, 2024Updated last year
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesisβ1,037Aug 28, 2023Updated 2 years ago
- Official implementation of Meta-StyleSpeech and StyleSpeechβ252Feb 9, 2022Updated 4 years ago
- VQ-VAE for Acoustic Unit Discovery and Voice Conversionβ339Jul 6, 2023Updated 2 years ago
- PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,β¦β317Aug 25, 2021Updated 4 years ago
- Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.β408Jul 7, 2021Updated 4 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.β229Aug 17, 2020Updated 5 years ago