Neural Lexicon Reader: Reduce Pronunciation Errors in End-to-end TTS by Leveraging External Textual Knowledge
☆21Jul 25, 2022Updated 3 years ago
Alternatives and similar repositories for neural-lexicon-reader
Users that are interested in neural-lexicon-reader are comparing it to the libraries listed below
Sorting:
- Chinese polyphone disambiguation for Text-to-Speech application☆42Jun 11, 2024Updated last year
- ☆39Apr 15, 2024Updated last year
- ☆19Mar 22, 2024Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆42Oct 13, 2023Updated 2 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- ☆25Mar 12, 2022Updated 3 years ago
- ☆25Mar 6, 2024Updated 2 years ago
- TTS前,文本标准化,将数字字母处理转化为汉字☆12Apr 27, 2024Updated last year
- ☆14Aug 16, 2023Updated 2 years ago
- A simple app for recording speech datasets.☆26Jun 27, 2022Updated 3 years ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆81Jun 7, 2024Updated last year
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆51Jun 11, 2024Updated last year
- ☆16Feb 19, 2026Updated 2 weeks ago
- A repository comprising of code for generation of noisy speech data from clean data using deep learning methods☆16Jul 12, 2021Updated 4 years ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- ☆31Nov 7, 2018Updated 7 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64May 30, 2023Updated 2 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Aug 31, 2020Updated 5 years ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆59Jun 20, 2024Updated last year
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆57Oct 31, 2023Updated 2 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Sep 24, 2021Updated 4 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- Link to paper: https://www.isca-speech.org/archive_v0/SpeechProsody_2020/pdfs/51.pdf☆32Jul 6, 2023Updated 2 years ago
- Digital Audio Effects in Python (material for MUSI6202@Georgiatech)☆15Nov 30, 2014Updated 11 years ago
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Apr 11, 2020Updated 5 years ago
- This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"☆16Jun 27, 2018Updated 7 years ago
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆68Dec 13, 2021Updated 4 years ago
- Audio samples from ICML2019 "Almost Unsupervised Text to Speech and Automatic Speech Recognition"☆17May 14, 2019Updated 6 years ago
- ASR & TTS joint training, asr, tts, machine speech chain☆16Oct 16, 2021Updated 4 years ago
- A Python3 program for converting Japanese words and numbers into phonemes.☆18Apr 24, 2018Updated 7 years ago
- All generative model in one for better TTS model☆74Sep 8, 2024Updated last year
- A tool for assignment to a slice in TensorFlow☆20Jul 23, 2021Updated 4 years ago
- ☆46Apr 16, 2023Updated 2 years ago
- EMPHASIS: An Emotional Phoneme-based Acoustic Model for Speech Synthesis System☆15Mar 31, 2019Updated 6 years ago