yousefkotp / Egyptian-Arabic-ASR-and-DiarizationLinks
The official submission from Speech Squad team for the MTC-AIC 2 competition of 2024 where an ASR model is developed tailored for the Egyptian dialect, utilizing the FastConformer architecture. Our four-stage training pipeline achieved a Mean Levenshtein Distance score of 9.58 on the test set.
☆11Updated 7 months ago
Alternatives and similar repositories for Egyptian-Arabic-ASR-and-Diarization
Users that are interested in Egyptian-Arabic-ASR-and-Diarization are comparing it to the libraries listed below
Sorting:
- TTS models for Arabic (Tacotron2, FastPitch)☆123Updated 11 months ago
- Arabic speech recognition, classification and text-to-speech.☆406Updated 2 years ago
- ☆58Updated 3 months ago
- ☆51Updated last year
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆12Updated 3 years ago
- ☆171Updated 10 months ago
- Arabic deep-learning based diacritization models (Shakkala, Shakkelha) ported to PyTorch☆14Updated 2 years ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆22Updated last year
- TTS for Arabic (FastPitch, Mixer-TTS) in the ONNX format☆31Updated 2 months ago
- Deep Visual Speech Recognition in arabic words☆16Updated 2 years ago
- Audio deepfake detection sytem on CNN☆62Updated 2 years ago
- ☆34Updated 8 months ago
- Finetune Wa2vec 2.0 For Speech Recognition☆140Updated 8 months ago
- The first Dialectal Arabic Code Switching - DACS corpus from broadcast speech. Annotated at the token-level, considering both the linguis…☆15Updated 3 years ago
- Fine-tune Bangla ASR model which was trained Bangla Mozilla Common Voice Dataset☆12Updated last year
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆62Updated 4 months ago
- Text to speech alignment using CTC forced alignment☆371Updated 2 months ago
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆345Updated 2 years ago
- Code-Switched translations with Large Language models☆22Updated 10 months ago
- Speech Emotion Recognition (SER) in real-time, using Deep Neural Networks (DNN) of Long Short Memory Term (LSTM).☆113Updated 3 years ago
- Developed and trained Gated-CNN models to detect types of stutter in speech and SVM classifier to suggest new therapies to the user accor…☆19Updated 4 years ago
- A morphosyntactic analyzer for the Arabic language.☆24Updated 5 years ago
- ☆24Updated last year
- This is a diacritization model for Arabic language. This model was built/trained using the Tashkeela: the Arabic diacritization corpus on…☆44Updated 2 years ago
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆14Updated last year
- Indic TTS for Indian Languages: This is a project on developing text-to-speech (TTS) synthesis systems for Indian languages, improving qu…☆45Updated last month
- Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer☆29Updated 4 years ago
- We'll look into audio categorization using deep learning principles like Artificial Neural Networks (ANN), 1D Convolutional Neural Networ…☆49Updated 3 years ago
- ☆42Updated 2 years ago
- A comprehensive list of Arabic NLP resources.☆35Updated last month