morrisalp / taatiknetLinks
Character-level conversion between Hebrew text and Latin transliteration using deep learning - a demonstration of seq2seq training.
☆13Updated last year
Alternatives and similar repositories for taatiknet
Users that are interested in taatiknet are comparing it to the libraries listed below
Sorting:
- Hebrew grapheme to phoneme (g2p)☆17Updated this week
- A sentence segmentation library with wide language support optimized for speed and utility.☆65Updated 9 months ago
- Audiobook alignment for Indigenous languages☆40Updated 2 weeks ago
- In-browser OCR of Ancient Greek and Latin☆26Updated last month
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆34Updated this week
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- A set of pipelines for performing experiments on various NLP tasks with a focus on resource-poor/minority languages.☆36Updated this week
- ☆14Updated 2 years ago
- An NLP pipeline for Hebrew☆38Updated this week
- Cog is a tool for comparing languages using lexicostatistics and comparative linguistics techniques.☆23Updated last year
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabet…☆43Updated 4 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated 3 years ago
- OCR-D post-correction module based on weighted finite-state transducers☆11Updated last year
- Hebrew nikud with transfomers☆19Updated 3 months ago
- Extracts plain text, language identification and more metadata from WARC records☆22Updated 3 months ago
- Mission to create a Hebrew TTS model as powerful and user-friendly as WaveNet☆34Updated 5 months ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 4 years ago
- A character-wise tokenizer for morphologically rich languages☆27Updated 2 months ago
- An even smaller speech recognizer / force aligner☆33Updated 5 months ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated last month
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)☆53Updated 2 years ago
- Ergonomic line-by-line transcription of scanned text.☆51Updated 4 years ago
- Loadable spellfix1 extension for sqlite as python package☆26Updated last year
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 3 months ago
- WordNet-LMF formats☆21Updated 2 weeks ago
- The Unicode Cookbook for Linguists☆54Updated 4 years ago
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆28Updated 3 months ago