mbzuai-nlp / ArTSTLinks
☆59Updated 4 months ago
Alternatives and similar repositories for ArTST
Users that are interested in ArTST are comparing it to the libraries listed below
Sorting:
- Official Repository of the Deep Diacritization Paper☆16Updated 4 years ago
- The official implementation of CATT Arabic diacritization models.☆54Updated 4 months ago
- TTS models for Arabic (Tacotron2, FastPitch)☆128Updated last year
- Arabic deep-learning based diacritization models (Shakkala, Shakkelha) ported to PyTorch☆14Updated 2 years ago
- ☆48Updated 2 years ago
- Finetune VITS and MMS using HuggingFace's tools☆176Updated last year
- TTS for Arabic (FastPitch, Mixer-TTS) in the ONNX format☆31Updated 2 months ago
- Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTK☆63Updated 8 years ago
- Country-level Arabic dialect identification (17 Arabic countries)☆51Updated 5 years ago
- ☆42Updated 2 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆151Updated last year
- Several deep learning models for restoring Arabic diacritics using Pytorch.☆35Updated 3 years ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆23Updated last year
- ☆44Updated 2 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆12Updated 3 years ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆85Updated last year
- Code, models, and data for "Advancements in Arabic Grammatical Error Detection and Correction: An Empirical Investigation". EMNLP 2023.☆17Updated last year
- Fine-Tune Whisper with Transformers and PEFT☆57Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆38Updated 2 years ago
- ☆177Updated 11 months ago
- finetune llm part for spark-tts model☆110Updated 7 months ago
- Whisper-Flamingo [Interspeech 2024] and mWhisper-Flamingo [IEEE SPL 2025] for Audio-Visual Speech Recognition and Translation☆192Updated 3 months ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Updated 3 years ago
- Benchmark Arabic text diacritization dataset☆76Updated 6 years ago
- Code-Switched translations with Large Language models☆24Updated 11 months ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated 2 years ago
- Add n-gram and large language model (LLM) support to Whisper models.☆35Updated 6 months ago
- Finetune Wa2vec 2.0 For Speech Recognition☆142Updated 9 months ago
- ☆151Updated 3 weeks ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆97Updated 5 months ago