TTS Text Analyzer
☆31Jul 20, 2023Updated 2 years ago
Alternatives and similar repositories for TTS-TextAnalyzer
Users that are interested in TTS-TextAnalyzer are comparing it to the libraries listed below
Sorting:
- ☆33Jun 29, 2023Updated 2 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Aug 7, 2023Updated 2 years ago
- ☆21Feb 27, 2024Updated 2 years ago
- ☆25Mar 12, 2022Updated 4 years ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆103Feb 5, 2024Updated 2 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Sep 21, 2022Updated 3 years ago
- ☆16Apr 4, 2022Updated 3 years ago
- ☆45Dec 16, 2019Updated 6 years ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- ☆69Mar 31, 2021Updated 4 years ago
- ☆80Aug 8, 2025Updated 7 months ago
- ☆26Sep 22, 2022Updated 3 years ago
- ☆56Jul 17, 2023Updated 2 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆52Jun 17, 2025Updated 9 months ago
- ☆55Jan 13, 2023Updated 3 years ago
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆48Sep 2, 2025Updated 6 months ago
- ☆71Jul 13, 2023Updated 2 years ago
- Unoffical implementation of Megatts2☆288Mar 23, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated 2 years ago
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆56Dec 11, 2022Updated 3 years ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆35Aug 30, 2025Updated 6 months ago
- 《SpeechGen: Unlocking the Generative Power of Speech Language Models with Prompts》☆77Jun 9, 2023Updated 2 years ago
- Avocodo: Generative Adversarial Network for Artifact-free Vocoder☆122Jul 14, 2022Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- Implementation of "DurIAN: Duration Informed Attention Network For Multimodal Synthesis".☆14Jul 6, 2020Updated 5 years ago
- ☆39Oct 1, 2023Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- Pytorch implementation of BigVSAN☆203Dec 9, 2025Updated 3 months ago
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆69Nov 1, 2024Updated last year
- The reproduced code for Google's SoundStorm☆272Oct 7, 2023Updated 2 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- GPT-style network for phonemization with durations of text☆68Mar 21, 2024Updated 2 years ago
- ☆19Mar 22, 2024Updated 2 years ago
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆237Feb 29, 2024Updated 2 years ago