morrisalp / unikudLinks
Hebrew nikud with transfomers
☆21Updated 10 months ago
Alternatives and similar repositories for unikud
Users that are interested in unikud are comparing it to the libraries listed below
Sorting:
- Hebrew Diacritizer☆48Updated 2 months ago
- An NLP pipeline for Hebrew☆40Updated 6 months ago
- Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.☆14Updated 2 years ago
- Fast syllable estimation library based on pattern matching.☆40Updated 2 weeks ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- Python Finite-State Toolkit☆60Updated last week
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆112Updated 7 months ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆32Updated 6 months ago
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).☆66Updated last week
- A Typescript package for getting syllabic data about Hebrew text with niqqud.☆15Updated last month
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated 9 months ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆108Updated last month
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆56Updated 3 months ago
- AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.☆25Updated 3 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆182Updated 7 months ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆52Updated 2 years ago
- Transliteration for languages and dialects☆44Updated 3 years ago
- Hebrew word lists☆48Updated last year
- A comprehensive list of Hebrew NLP resources.☆283Updated 7 months ago
- Multilingual syllable annotation pipeline component for spacy☆39Updated 2 years ago
- Massively multilingual pronunciation mining☆359Updated 4 months ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆41Updated last year
- now you can even use apertium from python☆35Updated last year
- downloads and parses subtitle dataset from opensubtitles.org☆16Updated last year
- Get phonetic spellings and syllable counts for any english word. Works with made-up and non-dictionary words☆99Updated 4 years ago
- 📝An easy-to-use package to restore punctuation of the text.☆119Updated 2 years ago
- Hebrew grapheme to phoneme (G2P)☆81Updated last week
- Targetted language identifier, based on FastText and Hunspell.☆38Updated 4 months ago
- ☆81Updated 3 weeks ago
- Prosodic: a metrical-phonological parser, written in Python. For English and Finnish, with flexible language support.☆291Updated 9 months ago