lingjzhu / probing-TTS-modelsView external linksLinks
Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf
☆32Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for probing-TTS-models
Users that are interested in probing-TTS-models are comparing it to the libraries listed below
Sorting:
- ☆12Jul 6, 2023Updated 2 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Jul 31, 2024Updated last year
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆53Jun 2, 2020Updated 5 years ago
- Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge☆21Jul 25, 2022Updated 3 years ago
- Code for paper titled "Using generative modelling to produce varied intonation for speech synthesis" submitted to the Speech Synthesis Wo…☆24Dec 8, 2019Updated 6 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- Implementation of the AlignTTS☆77Jul 6, 2023Updated 2 years ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Dec 30, 2023Updated 2 years ago
- ☆26Sep 22, 2022Updated 3 years ago
- ☆15May 8, 2021Updated 4 years ago
- Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.☆15Nov 25, 2023Updated 2 years ago
- Model Fusion Based Prosody Prediction☆17Mar 18, 2018Updated 7 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- An unofficial PyTorch implementation of Mix-Phoneme-Bert☆40Jul 10, 2023Updated 2 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Aug 30, 2019Updated 6 years ago
- Compressed version of Tacotron 2 using Tensor Train + Waveglow.☆22Dec 26, 2019Updated 6 years ago
- ☆37May 8, 2021Updated 4 years ago
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Dec 17, 2020Updated 5 years ago
- ☆23Oct 17, 2024Updated last year
- Gaussian Mixture VAE Tacotron☆53Jul 6, 2023Updated 2 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- ☆134Feb 4, 2023Updated 3 years ago
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 4 months ago
- ☆21Jun 16, 2021Updated 4 years ago
- ☆26Jun 5, 2024Updated last year
- Framework for Deep Speech Recognition☆11Nov 22, 2022Updated 3 years ago
- Code for ACL 2023 main conference paper "Back Translation for Speech-to-text Translation Without Transcripts".☆12Oct 25, 2023Updated 2 years ago
- Urdu Word Segmentation using Conditional Random Fields (CRFs)☆12Oct 3, 2018Updated 7 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- text to speech☆10Mar 19, 2024Updated last year
- ☆25Mar 12, 2022Updated 3 years ago
- A collection of examples demonstrating how we can build speech synthesis systems using nnmnkwii.☆71May 15, 2020Updated 5 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Aug 17, 2020Updated 5 years ago
- A collection of utilities for handling IPA phones.☆26Sep 24, 2023Updated 2 years ago
- ☆80Aug 8, 2025Updated 6 months ago
- ☆14Aug 16, 2023Updated 2 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- ☆45Dec 16, 2019Updated 6 years ago