elazarg / nakdimonLinks
Hebrew Diacritizer
☆45Updated last month
Alternatives and similar repositories for nakdimon
Users that are interested in nakdimon are comparing it to the libraries listed below
Sorting:
- Hebrew grapheme to phoneme (G2P)☆79Updated last month
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- Hebrew nikud with transfomers☆21Updated 10 months ago
- phone inventory library☆17Updated 2 years ago
- ☆56Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Updated 3 years ago
- Python module for syllabifying English ARPABET transcriptions☆71Updated 6 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆38Updated 9 months ago
- ☆13Updated 3 years ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆90Updated last year
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
- AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.☆25Updated 3 years ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- ☆19Updated 3 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Coqui STT (🐸STT) based forced alignment tool☆13Updated 3 years ago
- Audiobook alignment for Indigenous languages☆45Updated this week
- Model for recasing and repunctuating ASR transcripts☆142Updated last year
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Updated 8 years ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆22Updated last month
- A JavaScript-based converter for transliterating Amharic text into Latin characters☆19Updated 3 years ago
- CMU dictionary in IPA instead of their subset of Arpabet☆16Updated last year
- IPA tokeniser☆17Updated 4 months ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆96Updated 2 years ago
- Workflow for forced alignment between languages☆23Updated last year
- Timething is a library for aligning text transcripts with their audio recordings.☆126Updated last year
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 6 years ago
- ☆38Updated last year
- Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court☆22Updated 3 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆31Updated 5 months ago