morrisalp / taatiknetLinks
Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.
☆14Updated 2 years ago
Alternatives and similar repositories for taatiknet
Users that are interested in taatiknet are comparing it to the libraries listed below
Sorting:
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- Hebrew grapheme to phoneme (G2P)☆85Updated last month
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆34Updated 7 months ago
- An NLP pipeline for Hebrew☆40Updated 7 months ago
- Translation demonstrator☆37Updated 5 years ago
- Audiobook alignment for Indigenous languages☆45Updated 3 weeks ago
- Unicode Standard tokenization routines and orthography profile segmentation☆38Updated 11 months ago
- A sentence segmentation library with wide language support optimized for speed and utility.☆84Updated last week
- Bilingual sentence similarity classifier using Tensorflow☆24Updated 6 years ago
- Hebrew Diacritizer☆48Updated 3 months ago
- Finite-state script normalization and processing utilities☆46Updated 2 weeks ago
- An advanced, extensible web front-end for the Manatee-open corpus search engine☆78Updated this week
- A free & open tool for transcribing audio interviews with offline ASR support☆25Updated 2 years ago
- Extracts plain text, language identification and more metadata from WARC records☆23Updated 3 months ago
- Aksharamukha Python Library☆56Updated 11 months ago
- phone inventory library☆17Updated 2 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Updated 9 months ago
- Transliteration data and models☆56Updated 9 years ago
- Jason Riggle's chart of phonological features in JSON format + extras☆54Updated last year
- A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.☆37Updated this week
- Evaluation of STT models for german language☆15Updated 4 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated this week
- scipts for working with open.bible data☆26Updated 4 years ago
- Speech recognition module for Python, supporting several engines and APIs, online and offline.☆13Updated 3 years ago
- An even smaller speech recognizer / force aligner☆37Updated last year
- 📈 A forced aligner intended for synchronization of narrated text☆102Updated 5 months ago
- UDAR Does Accented Russian: A finite-state morphological analyzer of Russian that handles stressed wordforms.☆29Updated 8 months ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 6 years ago
- A database of number names for 186 languages, locales, and scripts☆67Updated 2 years ago
- Curated corpus of parallel data derived from versions of the Bible provided by eBible.org.☆80Updated 8 months ago