morrisalp / unikud
Hebrew nikud with transfomers
☆15Updated 2 years ago
Alternatives and similar repositories for unikud:
Users that are interested in unikud are comparing it to the libraries listed below
- Hebrew Diacritizer☆32Updated 4 months ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 5 years ago
- ☆30Updated 7 months ago
- Python transliteration library (mostly from non-latin scripts, such as Arabic, Japanese, etc.)☆20Updated 6 years ago
- An NLP pipeline for Hebrew☆36Updated 9 months ago
- AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.☆22Updated 2 years ago
- Python module for syllabifying English ARPABET transcriptions☆64Updated 5 years ago
- Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 made avilable via TPU Research Cloud Program.☆21Updated 2 years ago
- Read, write, and manipulate Praat TextGrid files with Python☆126Updated last year
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆20Updated last year
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆14Updated 2 years ago
- IPA Pronunciation Dictionaries in DSL format☆39Updated 8 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆30Updated 5 months ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆43Updated 4 years ago
- A python package for deep multilingual punctuation prediction.☆111Updated 4 months ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 3 years ago
- This is an implementation of the audio source separation model as well as the evaluation metrics proposed in the paper "Weakly Informed A…☆9Updated 5 years ago
- Python Finite-State Toolkit☆47Updated last week
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆82Updated 8 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆144Updated this week
- A field-tested Hebrew tokenizer for dirty texts (ben-yehuda project, bible, cc100, mc4, opensubs, oscar, twitter) focused on multi-word e…☆22Updated 2 years ago
- Grapheme To Phoneme☆70Updated 5 months ago
- The CMU Pronouncing Dictionary converted to IPA☆78Updated 5 years ago
- ☆11Updated 2 years ago
- Transliteration for languages and dialects☆42Updated 2 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆104Updated 2 months ago
- ☆16Updated 5 years ago
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.☆31Updated 11 months ago
- ☆49Updated 2 years ago
- ☆22Updated 2 years ago