mbzuai-nlp / ArTSTLinks
☆49Updated last week
Alternatives and similar repositories for ArTST
Users that are interested in ArTST are comparing it to the libraries listed below
Sorting:
- The official implementation of CATT Arabic diacritization models.☆46Updated last month
- Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTK☆61Updated 8 years ago
- ☆43Updated 2 years ago
- ☆42Updated 2 years ago
- ☆47Updated 2 years ago
- Country-level Arabic dialect identification (17 Arabic countries)☆47Updated 5 years ago
- TTS models for Arabic (Tacotron2, FastPitch)☆119Updated 8 months ago
- Arabic deep-learning based diacritization models (Shakkala, Shakkelha) ported to PyTorch☆14Updated 2 years ago
- Various speech datasets made available to the public☆123Updated 7 months ago
- Finetune VITS and MMS using HuggingFace's tools☆159Updated last year
- Official Repository of the Deep Diacritization Paper☆16Updated 4 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆147Updated last year
- Add n-gram and large language model (LLM) support to Whisper models.☆29Updated 2 months ago
- NPTEL2020: Speech2Text dataset for Indian-English Accent☆78Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated last year
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated last year
- A merged version of multiple open-source German speech datasets.☆31Updated last year
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆82Updated 2 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆13Updated 3 years ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆59Updated last month
- Speaker change detection using SincNet and an LSTM/Transformer☆53Updated last month
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 2 years ago
- asr2k☆51Updated last year
- Universal multilingual automatic speech transcription into IPA☆65Updated 4 months ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆96Updated last month
- Pronounce Arabic words☆19Updated 6 years ago
- Advanced data structures for handling temporal segments with attached labels.☆114Updated 5 months ago