mush42 / libtashkeel
Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models
☆19Updated last month
Related projects: ⓘ
- Arabic deep-learning based diacritization models (Shakkala, Shakkelha) ported to PyTorch☆12Updated last year
- Several deep learning models for restoring Arabic diacritics using Pytorch.☆32Updated 2 years ago
- Neural Arabic text diacritization☆82Updated last year
- TTS models for Arabic (Tacotron2, FastPitch)☆81Updated 4 months ago
- Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTK☆58Updated 7 years ago
- A comprehensive list of Arabic NLP resources.☆12Updated last year
- A Docker image for a relatively light-weight full Arabic speech synthesis system☆29Updated 3 years ago
- Benchmark Arabic text diacritization dataset☆70Updated 5 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆14Updated 2 years ago
- Pronounce Arabic words☆17Updated 5 years ago
- This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on…☆34Updated last year
- Arabic Transliteration in Python☆33Updated 11 years ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆15Updated 2 months ago
- ☆40Updated last year
- Spell check for Arabic text using python☆14Updated 5 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Updated 7 years ago
- AsoSoft Speech Corpus for Central-Kurdish Text-To-Speech☆12Updated 2 years ago
- Qalsadi: Arabic mophological analyzer Library for python.☆34Updated 2 weeks ago
- End to end Arabic TTS system based on tacotron☆116Updated 5 months ago
- Includes an Arabic diacritizer, IPA converter, and arabic-only filter☆14Updated 7 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- Official Repository of the Deep Diacritization Paper☆16Updated 3 years ago
- The official implementation of CATT Arabic diacritization models.☆30Updated last month
- Python library used for Arabic NLP to process, prepare and clean the Arabic text☆16Updated 2 months ago
- Arabic Tokenization Library. It provides many tokenization algorithms.☆85Updated 8 months ago
- The largest public catalogue for Arabic NLP and speech datasets. There are +500 datasets annotated with more than 25 attributes.☆146Updated 2 months ago
- Arabic TTS ( الناطق العربي )☆28Updated 5 years ago
- تفريغ النصوص وإنشاء ملفات SRT و VTT باستخدام نماذج Whisper وتقنية wit.ai.☆98Updated this week
- ☆20Updated 3 months ago
- Extract plain text from Arabic Wikipedia dumps.☆13Updated 10 years ago