matthewmorrone / cmudict-ipa
CMU dictionary in IPA instead of their subset of Arpabet
☆14Updated last month
Related projects ⓘ
Alternatives and complementary repositories for cmudict-ipa
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆40Updated 2 months ago
- ☆19Updated 6 years ago
- Self-contained Python package for OpenFst☆50Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 4 years ago
- A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine☆14Updated 7 years ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Updated last year
- Labeled data for homograph disambiguation☆53Updated last year
- Easier analysis of large speech corpora☆21Updated 3 years ago
- A collection of utilities for handling IPA phones.☆24Updated last year
- Simple Kaldi recipe for forced alignment☆10Updated last year
- ☆31Updated 2 months ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- The CMU Pronouncing Dictionary converted to IPA☆78Updated 5 years ago
- BurrMill core☆21Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated last year
- Word Error Rate Estimation☆10Updated 4 years ago
- Grapheme-to-Phoneme conversion with Joint-Sequence RnnLMs☆29Updated 9 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆33Updated 2 years ago
- Multilingual Grapheme to Phoneme☆49Updated 8 years ago
- A phoneme-allophone database for many languages☆48Updated 4 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated 7 months ago
- ☆22Updated 3 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆58Updated 2 months ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆35Updated 3 years ago
- ☆56Updated last year
- Convert words to numbers☆20Updated 2 years ago