microsoft / PhoneticMatching
A phonetic matching library. Includes text utilities to do string comparisons on phonemes (the sound of the string), as opposed to characters.
☆157Updated last year
Related projects ⓘ
Alternatives and complementary repositories for PhoneticMatching
- Port of PragmaticSegmenter for sentence boundary detection☆33Updated 3 years ago
- Unsupervised factor-based text tokenizer for natural-language processing applications☆17Updated 4 years ago
- A tool for automatic phoneme transcription☆157Updated last year
- Program to benchmark various speech recognition APIs☆79Updated 5 years ago
- Pure C# port of the Pocketsphinx keyword spotter☆12Updated 4 years ago
- Microsoft Speech Language Translation (MSLT) Corpus☆19Updated 7 years ago
- Proof of concept of training a simple Region Classifier using PdfPig and ML.NET (LightGBM). The objective is to classify each text block…☆23Updated 4 years ago
- pronunciation dictionaries for multiple languages☆83Updated 7 years ago
- DeepSpeech based forced alignment tool☆235Updated 3 years ago
- Python module for syllabifying English ARPABET transcriptions☆64Updated 5 years ago
- A tool for transcribing orthographic text as IPA (International Phonetic Alphabet)☆655Updated 2 months ago
- C# Word2Vec object with fast neighbor search. Format compatible with gensim☆25Updated 4 years ago
- SpacyDotNet is a .NET wrapper for the popular natural language library spaCy☆32Updated 3 years ago
- Model for recasing and repunctuating ASR transcripts☆129Updated 7 months ago
- The CMU Pronouncing Dictionary converted to IPA☆78Updated 5 years ago
- Text and Punctuation correction with Deep Learning☆129Updated 4 years ago
- Fast Word Segmentation with Triangular Matrix☆77Updated 3 years ago
- Covering grammars for English and Russian text normalization☆60Updated 5 years ago
- An LSTM RNN for restoring missing punctuation in unsegmented text.☆79Updated 8 years ago
- British English pronunciation dictionary☆89Updated 7 years ago
- An off-the-shelf client-side language identification module for JavaScript.☆14Updated 10 years ago
- Automatically exported from code.google.com/p/m2m-aligner☆42Updated 8 years ago
- Tool for creation, manipulation and maintenance of voice corpora☆81Updated 6 months ago
- Massively multilingual pronunciation mining☆321Updated this week
- Python interface to ISLEX, an English IPA pronunciation dictionary with syllable and stress marking.☆47Updated 11 months ago
- .NET Standard wrapper for fastText library. Now works on Windows, Linux and MacOs!☆70Updated last month
- Fast approximate strings search & spelling correction☆57Updated 3 years ago
- Labeled data for homograph disambiguation☆53Updated last year
- A sentence segmenter that actually works!☆302Updated 4 years ago
- CMUdict maintenance, and tools☆202Updated 5 months ago