mbzuai-nlp / ArTSTLinks
β60Updated 5 months ago
Alternatives and similar repositories for ArTST
Users that are interested in ArTST are comparing it to the libraries listed below
Sorting:
- The official implementation of CATT Arabic diacritization models.β56Updated 5 months ago
- ποΈ Arabic TTS models (Tacotron2, FastPitch)β133Updated 2 weeks ago
- Official Repository of the Deep Diacritization Paperβ16Updated 5 years ago
- β45Updated 3 years ago
- β49Updated 3 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translationβ151Updated last year
- Arabic deep-learning based diacritization models (Shakkala, Shakkelha) ported to PyTorchβ14Updated 2 years ago
- β41Updated 3 years ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNsβ23Updated last year
- Finetune VITS and MMS using HuggingFace's toolsβ184Updated last year
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLMβ37Updated 2 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://β¦β13Updated 3 years ago
- Convert Arabic diacritised text to a sequence of phonemes and create a pronunciation dictionary from them for alignment using HTKβ63Updated 8 years ago
- ποΈ Arabic TTS models (FastPitch, Mixer-TTS) in the ONNX format β Python package for offline speech synthesis ππ¦β32Updated 2 weeks ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.β86Updated last year
- Fine-Tune Whisper with Transformers and PEFTβ58Updated 2 years ago
- β156Updated 3 weeks ago
- Several deep learning models for restoring Arabic diacritics using Pytorch.β36Updated 3 years ago
- Finetune Wa2vec 2.0 For Speech Recognitionβ145Updated 10 months ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguisβ¦β15Updated 3 years ago
- ChunkFormer: Masked Chunking Conformer For Long-Form Speech Transcriptionβ73Updated last month
- Various speech datasets made available to the publicβ130Updated last year
- finetune llm part for spark-tts modelβ116Updated 9 months ago
- β185Updated last year
- NPTEL2020: Speech2Text dataset for Indian-English Accentβ79Updated 4 years ago
- A python package for deep multilingual punctuation prediction.β152Updated last year
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfacβ¦β125Updated last year
- Add n-gram and large language model (LLM) support to Whisper models.β40Updated 7 months ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student trainiβ¦β13Updated last year
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problemβ97Updated 7 months ago