nawarhalabi / Arabic-PhonetiserLinks
Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTK
☆63Updated 8 years ago
Alternatives and similar repositories for Arabic-Phonetiser
Users that are interested in Arabic-Phonetiser are comparing it to the libraries listed below
Sorting:
- Automatic Dialect Detection Repository☆39Updated 3 years ago
- Pronounce Arabic words☆19Updated 6 years ago
- This repository☆30Updated 3 years ago
- Country-level Arabic dialect identification (17 Arabic countries)☆53Updated 6 years ago
- End to end Arabic TTS system based on tacotron☆125Updated last year
- Benchmark Arabic text diacritization dataset☆77Updated 6 years ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Updated 3 years ago
- Arabic Phonetic Dictionary Generator Tool for Automatic Speech Recognition Applications☆12Updated 4 years ago
- End to End Dialect Identification using Convolutional Neural Network☆53Updated 6 years ago
- A python tool that converts Arabic diacritised text to a sequence of phonemes and creates a pronunciation dictionary. This code is based …☆16Updated 8 years ago
- Official Repository of the Deep Diacritization Paper☆17Updated 5 years ago
- Grapheme To Phoneme☆74Updated last year
- Support tools for punctuation and boundary detection for ASR output.☆55Updated 3 years ago
- Arabic deep-learning based diacritization models (Shakkala, Shakkelha) ported to PyTorch☆14Updated 2 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- asr2k☆52Updated last year
- Linguistic processing for Common Voice☆58Updated 2 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆33Updated 5 years ago
- A Docker image for a relatively light-weight full Arabic speech synthesis system☆31Updated 4 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- Code for AccentDB.☆23Updated 4 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆38Updated 11 months ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆40Updated 3 years ago
- A phoneme-allophone database for many languages☆53Updated 5 years ago
- scipts for working with open.bible data☆26Updated 3 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46Updated 2 years ago
- 24-hour Automatic Speech Recognition☆27Updated 4 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆143Updated last year