microsoft / PhoneticMatching
A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to characters.
☆157Updated last year
Alternatives and similar repositories for PhoneticMatching:
Users that are interested in PhoneticMatching are comparing it to the libraries listed below
- A tool for automatic phoneme transcription☆157Updated last year
- pronunciation dictionaries for multiple languages☆86Updated 7 years ago
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Updated 4 years ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆82Updated 10 months ago
- An LSTM RNN for restoring missing punctuation in unsegmented text.☆79Updated 8 years ago
- A Collection of Speech Corpus for ASR and TTS☆113Updated 7 years ago
- Fast approximate strings search & spelling correction☆58Updated 3 years ago
- Punctuation generation for speech transcripts using lexical and prosodic features☆41Updated 6 years ago
- Server framework for Kaldi ASR Toolkit☆98Updated last year
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 4 years ago
- Program to benchmark various speech recognition APIs☆80Updated 5 years ago
- DeepSpeech based forced alignment tool☆237Updated 4 years ago
- g2p: English Grapheme To Phoneme Conversion☆839Updated 2 years ago
- Multilingual Grapheme to Phoneme☆49Updated 9 years ago
- Port of PragmaticSegmenter for sentence boundary detection☆34Updated 3 years ago
- Command line tool to create corpora for Common Voice☆75Updated 9 months ago
- Labeled data for homograph disambiguation☆56Updated last year
- Word Segmentation with Dynamic Programming☆20Updated 3 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆35Updated 2 weeks ago
- Fast Word Segmentation with Triangular Matrix☆81Updated 3 years ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- A phoneme-allophone database for many languages☆50Updated 4 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆131Updated 11 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆149Updated this week
- Experimental project to punctuate text using a embedding layer, single convolutional layer and output softmax layer written in Keras.☆83Updated 4 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆156Updated last year
- Grapheme To Phoneme☆70Updated 7 months ago
- Automatically constructing corpus for automatic speech recognition from YouTube videos☆154Updated 5 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 10 months ago
- CMUdict maintenance, and tools☆210Updated 2 months ago