mush42 / hareefLinks
state-of-the-art models for diacritics restoration for Arabic language
☆15Updated 9 months ago
Alternatives and similar repositories for hareef
Users that are interested in hareef are comparing it to the libraries listed below
Sorting:
- Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models☆40Updated 2 months ago
- This is the official repository for the HUI-Audio-Corpus-German. The corresponding paper is in the process of publication. With the repo…☆33Updated 2 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆45Updated 2 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆140Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆41Updated 2 weeks ago
- Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTK☆63Updated 8 years ago
- Universal multilingual automatic speech transcription into IPA☆72Updated 9 months ago
- Linguistic processing for Common Voice☆58Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆172Updated 2 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆119Updated 2 years ago
- Country-level Arabic dialect identification (17 Arabic countries)☆51Updated 5 years ago
- Word Error Rate Estimation☆15Updated 5 years ago
- A toolkit to calculate speech audio quality. Not affiliated with the original authors☆63Updated last year
- Colab notebooks for Next-gen Kaldi☆30Updated last month
- Keyword spotting and forced alignment in any language☆79Updated 3 months ago
- 🎙️ Arabic TTS models (Tacotron2, FastPitch)☆130Updated last week
- Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.☆88Updated 5 months ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆77Updated 5 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 4 years ago
- The VoxTube dataset official repository☆71Updated last year
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Updated 8 months ago
- 🎙️ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format — Python package for offline speech synthesis 🚀📦☆31Updated 3 months ago
- ☆80Updated 4 months ago
- ☆92Updated last month
- ☆40Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆153Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated 2 years ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆28Updated last month
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆100Updated 8 months ago
- Clustering-based methods for overlapping diarization☆81Updated last year