OmarMohammed88 / AR-Emotion-RecognitionLinks
An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://journals.scholarpublishing.org/index.php/TMLAI/article/view/11039
☆13Updated 3 years ago
Alternatives and similar repositories for AR-Emotion-Recognition
Users that are interested in AR-Emotion-Recognition are comparing it to the libraries listed below
Sorting:
- ☆60Updated 5 months ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 3 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆13Updated last year
- Finetune Wa2vec 2.0 For Speech Recognition☆142Updated 10 months ago
- ☆44Updated 2 years ago
- Official Repository of the Deep Diacritization Paper☆16Updated 4 years ago
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆17Updated 2 years ago
- ☆10Updated 2 years ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆31Updated 2 years ago
- Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer☆29Updated 4 years ago
- Generated Audio Samples by ALGAN-VC model are available in the folder☆19Updated 3 years ago
- ☆48Updated 2 years ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆23Updated last year
- Wav2Vec for speech recognition, classification, and audio classification☆269Updated 3 years ago
- ☆14Updated 2 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆97Updated 6 months ago
- Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR☆65Updated 6 months ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆42Updated last year
- This repository contains code for fine-tuning the Whisper speech-to-text model.☆19Updated last month
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Updated 3 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆37Updated 2 years ago
- ☆69Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆43Updated 2 years ago
- A unified dataset of multilingual emotional human utterances☆28Updated 3 years ago
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆16Updated 4 years ago
- An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper☆21Updated 3 years ago
- ☆49Updated 3 years ago
- Word-level language identification for Bangla-English code-mixed social media data, using a BiLSTM with subword embeddings.☆10Updated 2 years ago
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆14Updated last year
- A multimodal approach on emotion recognition using audio and text.☆185Updated 5 years ago