morrisalp / unikud
Hebrew nikud with transfomers
☆16Updated last week
Alternatives and similar repositories for unikud:
Users that are interested in unikud are comparing it to the libraries listed below
- AlephBertGimmel - Modern Hebrew pretrained BERT model with a 128K token vocabulary.☆22Updated 2 years ago
- Hebrew Diacritizer☆35Updated 5 months ago
- An NLP pipeline for Hebrew☆36Updated 10 months ago
- ivrit.ai codebase☆29Updated this week
- Fast syllable estimation library based on pattern matching.☆37Updated last month
- ☆13Updated 6 years ago
- Modified version of RusStress (https://github.com/MashaPo/russtress) — python package for placing stress in Russian text using RNN (BiLST…☆31Updated 6 months ago
- ☆33Updated 8 months ago
- Hebrew word lists☆42Updated 3 months ago
- This is an open-source effort for making Hebrew properly searchable by various IR software libraries, while maintaining decent recall, pr…☆102Updated 2 years ago
- Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 made avilable via TPU Research Cloud Program.☆21Updated 2 years ago
- ☆9Updated 3 months ago
- ☆49Updated 2 years ago
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆86Updated last year
- OCTRA is a web-application for the orthographic transcription of audio files.☆37Updated this week
- Wrapper for pydub AudioSegment objects☆96Updated 2 years ago
- ☆33Updated last year
- Data and code for grapheme-to-phoneme transducers in lots of languages☆132Updated 10 months ago
- Dictionary of pairs of Korean word and IPA crawled from Wiktionary (Korean edition)☆20Updated last year
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆82Updated 9 months ago
- Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignme…☆56Updated 4 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 5 years ago
- SubER - Subtitle Edit Rate☆22Updated 6 months ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆147Updated last month
- downloads and parses subtitle dataset from opensubtitles.org☆15Updated 10 months ago
- The official implementation of "A Language Modeling Approach to Diacritic-Free Hebrew TTS"☆82Updated 7 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆24Updated last week
- Get phonetic spellings and syllable counts for any english word. Works with made-up and non-dictionary words☆93Updated 3 years ago
- Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆80Updated last year
- A python library for real-time audio time-scale modification procedures☆88Updated 7 years ago