OmarMohammed88 / AR-Emotion-RecognitionLinks
An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://journals.scholarpublishing.org/index.php/TMLAI/article/view/11039
☆12Updated 3 years ago
Alternatives and similar repositories for AR-Emotion-Recognition
Users that are interested in AR-Emotion-Recognition are comparing it to the libraries listed below
Sorting:
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆32Updated 3 years ago
- ☆54Updated 2 months ago
- ☆14Updated 2 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆138Updated 7 months ago
- Generated Audio Samples by ALGAN-VC model are available in the folder☆19Updated 3 years ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆29Updated last year
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆31Updated last year
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆17Updated 2 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆267Updated 3 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆12Updated last year
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆22Updated last year
- ☆43Updated 2 years ago
- ☆48Updated last year
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆97Updated 3 months ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆31Updated last year
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Updated 2 years ago
- Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.☆13Updated 3 years ago
- This project is about performing Speaker diarization for Hindi Language.☆51Updated 4 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆10Updated 4 months ago
- A unified dataset of multilingual emotional human utterances☆28Updated 3 years ago
- ☆17Updated 2 years ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆42Updated 2 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 4 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆17Updated 2 years ago
- A multimodal approach on emotion recognition using audio and text.☆184Updated 5 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆207Updated 2 years ago
- Hosts text-to-speech corpus and speech synthesizers for African languages.☆17Updated 2 years ago
- This repository contains code for fine-tuning the Whisper speech-to-text model.☆15Updated 3 weeks ago
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆13Updated 2 years ago
- [Computer Speech & Language] A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages☆13Updated last year