Convert English text from written expressions into spoken forms
β28Jun 22, 2022Updated 3 years ago
Alternatives and similar repositories for TTSTextNormalization
Users that are interested in TTSTextNormalization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π¬π A small dictation app using OpenAI's Whisper speech recognition model.β11Sep 13, 2024Updated last year
- β10Sep 19, 2022Updated 3 years ago
- Google's SoundStorm: Efficient Parallel Audio Generationβ131Aug 8, 2023Updated 2 years ago
- MOS score prediction by fine-tuned wav2vec2.0 modelβ177Oct 20, 2022Updated 3 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme modelsβ48Mar 25, 2022Updated 4 years ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- β55Jan 13, 2023Updated 3 years ago
- β23Oct 17, 2024Updated last year
- Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictionsβ268Jan 13, 2025Updated last year
- β11Sep 9, 2019Updated 6 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into oneβ26Aug 5, 2024Updated last year
- Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using Γ-VAE"β44Apr 10, 2023Updated 3 years ago
- A collection of utilities for handling IPA phones.β26Sep 24, 2023Updated 2 years ago
- Labeled data for homograph disambiguationβ62Jun 1, 2023Updated 2 years ago
- β67Jul 16, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- NeMo text processing for ASR and TTSβ450Updated this week
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsβ176Dec 18, 2023Updated 2 years ago
- RSVPMaker Events and Registration Plugin for WordPressβ15Mar 28, 2026Updated last week
- β32Jan 6, 2022Updated 4 years ago
- Reimplementation of Miipherβ30Aug 16, 2023Updated 2 years ago
- Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversionβ104Mar 10, 2026Updated last month
- β69May 19, 2023Updated 2 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline systemβ36Oct 28, 2019Updated 6 years ago
- Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023β252Jun 5, 2025Updated 10 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Covering grammars for English and Russian text normalizationβ61Sep 15, 2019Updated 6 years ago
- Phoneme alignment representation compatible with multiple forced alignersβ22Apr 12, 2024Updated last year
- Project for HIDING SPEAKERβS SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINEβ15Nov 30, 2022Updated 3 years ago
- Simple tool for speech dataset augmentation for modeling various prosodies.β14Jan 14, 2021Updated 5 years ago
- A repository for dictionaries to be used with the Prosodylab-Alignerβ17May 13, 2014Updated 11 years ago
- β139Jan 7, 2024Updated 2 years ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loudβ¦β111Dec 20, 2024Updated last year
- sound stretch python moduleβ11May 1, 2019Updated 6 years ago
- Grapheme to phoneme conversion with deep learning.β426Dec 8, 2023Updated 2 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- CML-TTS: A Multilingual Dataset for Speech Synthesisβ37Jul 31, 2024Updated last year
- β26Jun 5, 2024Updated last year
- proof of concept conversation orchestrator with a speech-language modelβ20Oct 19, 2024Updated last year
- β26Sep 22, 2022Updated 3 years ago
- β28Nov 15, 2023Updated 2 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech β¦β28Nov 7, 2025Updated 5 months ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalizationβ103Feb 5, 2024Updated 2 years ago