matthewmorrone/cmudict-ipa

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/matthewmorrone/cmudict-ipa)

matthewmorrone / cmudict-ipa

CMU dictionary in IPA instead of their subset of Arpabet

☆16

Alternatives and similar repositories for cmudict-ipa

Users that are interested in cmudict-ipa are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

menelik3 / cmudict-ipa
View on GitHub
The CMU Pronouncing Dictionary converted to IPA
☆97Jun 29, 2019Updated 7 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
wwesantos / arpabet-to-ipa
View on GitHub
Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…
☆43Jul 31, 2020Updated 6 years ago
Connum / npm-pinyin2ipa
View on GitHub
Converts Mandarin Chinese pinyin notation to IPA (international phonetic alphabet) notation
☆19Nov 28, 2023Updated 2 years ago
coryshain / dnnseg
View on GitHub
☆11Mar 20, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
talker93 / oneMinTTS
View on GitHub
Launch your speech synthesis within one minute.
☆12May 6, 2024Updated 2 years ago
lociko / ukraine_itn_wfst
View on GitHub
Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini
☆19Oct 21, 2025Updated 9 months ago
dogancan / expected-edit-distance
View on GitHub
Expected edit distance implementation using OpenFst tools
☆11May 13, 2015Updated 11 years ago
markusdr / transducersaurus
View on GitHub
Automatically exported from code.google.com/p/transducersaurus
☆11Apr 1, 2015Updated 11 years ago
danijel3 / SparrowhawkTest
View on GitHub
A simple tutorial on setting up Sparrowhawk - a text-to-speech normalization engine
☆14Oct 16, 2017Updated 8 years ago
DanielSWolf / wiki-pronunciation-dict
View on GitHub
Pronunciation dictionaries for several languages, based on Wiktionary data.
☆21Nov 28, 2021Updated 4 years ago
tiro-is / tiro-speech-core
View on GitHub
This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core
☆15Jun 19, 2023Updated 3 years ago
motazsaad / ara-pronunciation-tool
View on GitHub
A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …
☆15Sep 5, 2017Updated 8 years ago
rhdunn / amepd
View on GitHub
American English Pronunciation Dictionary
☆35Apr 16, 2018Updated 8 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ccoreilly / deepspeech-catala
View on GitHub
Deepspeech ASR Model for the Catalan Language
☆17Feb 15, 2021Updated 5 years ago
mozilla / g2p
View on GitHub
INACTIVE - http://mzl.la/ghe-archive - Tools to create ARPA models from cmu pocketsphinx dictionaries for proper g2p generation
☆21Mar 29, 2019Updated 7 years ago
MLSpeech / DeepPhoneticToolsTutorial
View on GitHub
Tutorial on {Deep} Phonetic Tools given in BigPhon @ LabPhon15
☆12Apr 17, 2017Updated 9 years ago
jacquelineCelia / lexicon_discovery
View on GitHub
Source code for "Unsupervised Lexicon Discovery from Acoustic Input ", Lee et al, 2015 TACL
☆10Aug 11, 2016Updated 9 years ago
open-dict-data / wikidict-en
View on GitHub
Wikipedia Bilingual Reference Data (English)
☆18Jun 17, 2016Updated 10 years ago
abuccts / wikt2pron
View on GitHub
A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format
☆34Jul 5, 2019Updated 7 years ago
CUNY-CL / wikipron-modeling
View on GitHub
Proposed splits for the LREC Wikipron paper
☆15Apr 7, 2020Updated 6 years ago
gullabi / STT-align
View on GitHub
Coqui STT (🐸STT) based forced alignment tool
☆13Feb 24, 2022Updated 4 years ago
TalnUPF / praat_web
View on GitHub
☆13Jun 30, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
clbecker / perl-wiktionary-parser
View on GitHub
Client and parser of documents pulled from the wiktionary api
☆15Jun 8, 2016Updated 10 years ago
dan-wells / kiss-aligner
View on GitHub
Simple Kaldi recipe for forced alignment
☆11Jul 16, 2023Updated 3 years ago
sigmorphon / 2020
View on GitHub
SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…
☆36Apr 25, 2025Updated last year
athena-team / athena-transform
View on GitHub
☆21Jan 13, 2020Updated 6 years ago
mcfletch / listener
View on GitHub
Pocketsphinx-based Linux Voice Dictation
☆26Jun 12, 2020Updated 6 years ago
amazon-science / proteno
View on GitHub
This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…
☆45May 25, 2021Updated 5 years ago
CoEDL / kaldi_helpers
View on GitHub
A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.
☆15May 19, 2020Updated 6 years ago
ucam-smt / ucam-smt
View on GitHub
Cambridge SMT System
☆18Aug 1, 2017Updated 8 years ago
burrmill / burrmill
View on GitHub
BurrMill core
☆22Nov 2, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
turnerdan / joethecorpusrogan
View on GitHub
A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…
☆21Jan 26, 2020Updated 6 years ago
rhasspy / phonetisaurus-pypi
View on GitHub
Python wrapper for phonetisaurus grapheme to phoneme tool
☆12Mar 11, 2021Updated 5 years ago
isi-nlp / carmel
View on GitHub
finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests
☆15Jan 24, 2017Updated 9 years ago
hlt-bme-hu / hunspeech
View on GitHub
☆14Jan 24, 2017Updated 9 years ago
noajshu / scotus-speech
View on GitHub
Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court
☆22Updated this week
JoseLlarena / Britfone
View on GitHub
British English pronunciation dictionary
☆107Oct 19, 2017Updated 8 years ago
danijel3 / ClarinStudioKaldi
View on GitHub
A baseline Automatic Speech Recognition system for Polish based on Kaldi.
☆18Dec 21, 2021Updated 4 years ago