tomaarsen/TTSTextNormalization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tomaarsen/TTSTextNormalization)

tomaarsen / TTSTextNormalization

Convert English text from written expressions into spoken forms

☆32

Alternatives and similar repositories for TTSTextNormalization

Users that are interested in TTSTextNormalization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dariox1337 / whisper-writer
View on GitHub
💬📝 A small dictation app using OpenAI's Whisper speech recognition model.
☆11Sep 13, 2024Updated last year
Chung-I / youtube-asr-crawler
View on GitHub
☆10Sep 19, 2022Updated 3 years ago
facebookresearch / lst
View on GitHub
Code for Latent Speech-Text Transformer (LST)
☆35Mar 12, 2026Updated 4 months ago
rishikksh20 / SoundStorm-pytorch
View on GitHub
Google's SoundStorm: Efficient Parallel Audio Generation
☆131Aug 8, 2023Updated 2 years ago
Reeceobligacion / Reserve-California-Bot
View on GitHub
☆12Sep 9, 2019Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
AndreevP / wvmos
View on GitHub
MOS score prediction by fine-tuned wav2vec2.0 model
☆180Oct 20, 2022Updated 3 years ago
ldong1111 / GraphemeBERT
View on GitHub
This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models
☆48Mar 25, 2022Updated 4 years ago
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
yl4579 / PL-BERT
View on GitHub
Phoneme-Level BERT for Enhanced Prosody of Text-to-Speech with Grapheme Predictions
☆269Jan 13, 2025Updated last year
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
CODEJIN / HierSpeech
View on GitHub
☆67Jul 16, 2023Updated 3 years ago
light1726 / BetaVAE_VC
View on GitHub
Implementation for paper "Disentangled Speech Representation Learning for One-Shot Cross-Lingual Voice Conversion Using ß-VAE"
☆43Apr 10, 2023Updated 3 years ago
google-research-datasets / WikipediaHomographData
View on GitHub
Labeled data for homograph disambiguation
☆62Jun 1, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
neonbjb / tts-scores
View on GitHub
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
☆175Dec 18, 2023Updated 2 years ago
MiniXC / phones
View on GitHub
A collection of utilities for handling IPA phones.
☆27Sep 24, 2023Updated 2 years ago
NVIDIA / NeMo-text-processing
View on GitHub
NeMo text processing for ASR and TTS
☆484Updated this week
davidfcarr / rsvpmaker
View on GitHub
RSVPMaker Events and Registration Plugin for WordPress
☆14Updated this week
ajaybati / miipher2.0
View on GitHub
Reimplementation of Miipher
☆30Aug 16, 2023Updated 2 years ago
MTG / PodcastMix-inference
View on GitHub
☆32Jan 6, 2022Updated 4 years ago
bshall / acoustic-model
View on GitHub
Acoustic models for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
☆104Mar 10, 2026Updated 4 months ago
IDEA-Emdoor-Lab / DistilCodec
View on GitHub
A Neural Audio Codec (NAC) for Universal Audio
☆46May 30, 2025Updated last year
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆37Oct 28, 2019Updated 6 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
rishikksh20 / NaturalSpeech2
View on GitHub
☆69May 19, 2023Updated 3 years ago
keonlee9420 / DailyTalk
View on GitHub
Official repository of DailyTalk: Spoken Dialogue Dataset for Conversational Text-to-Speech, ICASSP 2023
☆259Jun 5, 2025Updated last year
maxrmorrison / pypar
View on GitHub
Phoneme alignment representation compatible with multiple forced aligners
☆22Apr 12, 2024Updated 2 years ago
google-research-datasets / TextNormalizationCoveringGrammars
View on GitHub
Covering grammars for English and Russian text normalization
☆61Sep 15, 2019Updated 6 years ago
CODEJIN / NaturalSpeech2
View on GitHub
☆139Jan 7, 2024Updated 2 years ago
ScottishFold007 / TTSAudioNormalizer
View on GitHub
TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…
☆112Dec 20, 2024Updated last year
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
prosodylab / prosodylab.dictionaries
View on GitHub
A repository for dictionaries to be used with the Prosodylab-Aligner
☆17May 13, 2014Updated 12 years ago
Jackson-Kang / Prosody-augmentation-for-Text-to-speech
View on GitHub
Simple tool for speech dataset augmentation for modeling various prosodies.
☆14Jan 14, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
freds0 / CML-TTS-Dataset
View on GitHub
CML-TTS: A Multilingual Dataset for Speech Synthesis
☆35Jul 31, 2024Updated last year
jbvolmer / soundstretch
View on GitHub
sound stretch python module
☆11May 1, 2019Updated 7 years ago
Tele-AI / TELEVAL
View on GitHub
☆24Jun 10, 2026Updated last month
axelspringer / DeepPhonemizer
View on GitHub
Grapheme to phoneme conversion with deep learning.
☆432Dec 8, 2023Updated 2 years ago
tincans-ai / gazelle-inference
View on GitHub
proof of concept conversation orchestrator with a speech-language model
☆20Oct 19, 2024Updated last year
vtuber-plan / NSF-HiFiGAN
View on GitHub
Vocoder NSF-HiFiGAN (Moved into deepaudio)
☆56Dec 11, 2022Updated 3 years ago
p0p4k / vits3_pytorch
View on GitHub
☆28Nov 15, 2023Updated 2 years ago