Convert English text from written expressions into spoken forms
β28Jun 22, 2022Updated 3 years ago
Alternatives and similar repositories for TTSTextNormalization
Users that are interested in TTSTextNormalization are comparing it to the libraries listed below
Sorting:
- π¬π A small dictation app using OpenAI's Whisper speech recognition model.β11Sep 13, 2024Updated last year
- β10Sep 19, 2022Updated 3 years ago
- Google's SoundStorm: Efficient Parallel Audio Generationβ131Aug 8, 2023Updated 2 years ago
- MOS score prediction by fine-tuned wav2vec2.0 modelβ176Oct 20, 2022Updated 3 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme modelsβ48Mar 25, 2022Updated 3 years ago
- β55Jan 13, 2023Updated 3 years ago
- β23Oct 17, 2024Updated last year
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ268Jan 13, 2025Updated last year
- β11Sep 9, 2019Updated 6 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using Γ-VAE"β44Apr 10, 2023Updated 2 years ago
- A collection of utilities for handling IPA phones.β26Sep 24, 2023Updated 2 years ago
- Labeled data for homograph disambiguationβ62Jun 1, 2023Updated 2 years ago
- β68Jul 16, 2023Updated 2 years ago
- NeMo text processing for ASR and TTSβ443Mar 13, 2026Updated last week
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ176Dec 18, 2023Updated 2 years ago
- β32Jan 6, 2022Updated 4 years ago
- RSVPMaker Events and Registration Plugin for WordPressβ15Updated this week
- Reimplementation of Miipherβ29Aug 16, 2023Updated 2 years ago
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversionβ104Mar 10, 2026Updated last week
- β69May 19, 2023Updated 2 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline systemβ36Oct 28, 2019Updated 6 years ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023β252Jun 5, 2025Updated 9 months ago
- CML-TTS: A Multilingual Dataset for Speech Synthesisβ34Jul 31, 2024Updated last year
- Phoneme alignment representation compatible with multiple forced alignersβ22Apr 12, 2024Updated last year
- Project for HIDING SPEAKERβS SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINEβ15Nov 30, 2022Updated 3 years ago
- A repository for dictionaries to be used with the Prosodylab-Alignerβ17May 13, 2014Updated 11 years ago
- Simple tool for speech dataset augmentation for modeling various prosodies.β14Jan 14, 2021Updated 5 years ago
- β140Jan 7, 2024Updated 2 years ago
- sound stretch python moduleβ11May 1, 2019Updated 6 years ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudβ¦β111Dec 20, 2024Updated last year
- Grapheme to phoneme conversion with deep learning.β421Dec 8, 2023Updated 2 years ago
- β26Jun 5, 2024Updated last year
- proof of concept conversation orchestrator with a speech-language modelβ20Oct 19, 2024Updated last year
- β26Sep 22, 2022Updated 3 years ago
- β28Nov 15, 2023Updated 2 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech β¦β28Nov 7, 2025Updated 4 months ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalizationβ103Feb 5, 2024Updated 2 years ago
- Incorporating AutoVocoder to MB-iSTFT-VITSβ48Dec 1, 2022Updated 3 years ago