morrisalp / unikudLinks
Hebrew nikud with transfomers
☆21Updated 11 months ago
Alternatives and similar repositories for unikud
Users that are interested in unikud are comparing it to the libraries listed below
Sorting:
- Hebrew Diacritizer☆48Updated 3 months ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆41Updated last year
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Updated 2 years ago
- AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.☆26Updated 3 years ago
- Hebrew grapheme to phoneme (G2P)☆85Updated last month
- 📝An easy-to-use package to restore punctuation of the text.☆119Updated 2 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆34Updated 7 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆54Updated 2 years ago
- An NLP pipeline for Hebrew☆40Updated 7 months ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆67Updated 3 weeks ago
- A python module to reduce Unicode to a 'good enough' ASCII representation (outdated Github copy)☆44Updated 15 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆112Updated 2 weeks ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated 9 months ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- ☆39Updated this week
- Massively multilingual pronunciation mining☆360Updated 2 weeks ago
- ☆81Updated last week
- Model for recasing and repunctuating ASR transcripts☆143Updated last year
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆57Updated 3 months ago
- universal syllabification algorithms☆46Updated 3 years ago
- Labeled data for homograph disambiguation☆63Updated 2 years ago
- now you can even use apertium from python☆35Updated last year
- Finite-state script normalization and processing utilities☆46Updated 2 weeks ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆90Updated last year
- British English pronunciation dictionary☆99Updated 8 years ago
- Python Finite-State Toolkit☆60Updated last month
- Get phonetic spellings and syllable counts for any english word. Works with made-up and non-dictionary words☆99Updated 4 years ago
- A python true casing utility that restores case information for texts☆88Updated 3 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆38Updated 11 months ago
- Self-contained Python package for OpenFst☆51Updated 2 years ago