Links to data used in Sproat & Jaitly (https://arxiv.org/abs/1611.00068) experiments.
☆77Jul 9, 2021Updated 4 years ago
Alternatives and similar repositories for text-normalization-data
Users that are interested in text-normalization-data are comparing it to the libraries listed below
Sorting:
- RNNs for Text Normalization☆40Dec 12, 2017Updated 8 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45May 25, 2021Updated 4 years ago
- ☆213Jun 16, 2018Updated 7 years ago
- Covering grammars for English and Russian text normalization☆60Sep 15, 2019Updated 6 years ago
- Python tool for normilizing text and text canonicalization (DISCONTINUED)☆41Sep 3, 2013Updated 12 years ago
- Estonian text-to-speech text normalization pipeline☆12Dec 17, 2025Updated 3 months ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation☆21Mar 29, 2019Updated 6 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆37Apr 12, 2018Updated 7 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Oct 16, 2017Updated 8 years ago
- Various scripts that facilitate the preparation of Automatic Speech Recognition related resources☆17Apr 16, 2020Updated 5 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Speech waveform synthesis filters☆13Jul 21, 2017Updated 8 years ago
- A module for normalising text.☆172Oct 27, 2021Updated 4 years ago
- Multilingual Grapheme to Phoneme☆51Feb 23, 2016Updated 10 years ago
- CS224S Course Project☆14Jun 9, 2014Updated 11 years ago
- ☆21Apr 4, 2015Updated 10 years ago
- ☆26Apr 21, 2021Updated 4 years ago
- Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain☆60Jul 2, 2019Updated 6 years ago
- PolEval 2021 Task 1☆15Jun 28, 2022Updated 3 years ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆175Dec 16, 2025Updated 3 months ago
- Small language toolkit for creation, interpolation and pruning of ARPA language models☆92Aug 6, 2022Updated 3 years ago
- Sequence-to-sequence TTS based on Kyubyong's dc_tts☆61Feb 2, 2023Updated 3 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Text☆247Oct 30, 2019Updated 6 years ago
- ☆10Mar 20, 2021Updated 5 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆82Dec 24, 2021Updated 4 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- A TensorFlow Implementation of Punctuation Restoration.☆18Nov 9, 2020Updated 5 years ago
- ☆45Oct 24, 2020Updated 5 years ago
- Crawling engine that crawls a set of top-level domains looking for documents in a list of languages☆11Feb 6, 2024Updated 2 years ago
- ☆47May 22, 2017Updated 8 years ago
- PAVOQUE Corpus of Expressive Speech☆12Aug 2, 2016Updated 9 years ago
- G2P with Tensorflow☆681Jul 29, 2024Updated last year
- ☆17Nov 25, 2019Updated 6 years ago
- ☆42Jul 17, 2018Updated 7 years ago
- A GPU language model, based on btree backed tries.☆29Mar 6, 2018Updated 8 years ago