OmarMohammed88 / AR-Emotion-RecognitionLinks
An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://journals.scholarpublishing.org/index.php/TMLAI/article/view/11039
☆13Updated 3 years ago
Alternatives and similar repositories for AR-Emotion-Recognition
Users that are interested in AR-Emotion-Recognition are comparing it to the libraries listed below
Sorting:
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆10Updated last year
- ☆49Updated last month
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆31Updated last year
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 3 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆265Updated 3 years ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- ☆14Updated 2 years ago
- Generated Audio Samples by ALGAN-VC model are available in the folder☆19Updated 3 years ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆28Updated last year
- Finetune Wa2vec 2.0 For Speech Recognition☆140Updated 6 months ago
- ☆49Updated last year
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆11Updated 2 years ago
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 2 years ago
- ☆43Updated 2 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆96Updated 2 months ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 4 years ago
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆16Updated 2 years ago
- This repository is the implementation of the paper, "Score-balanced Loss for Multi-aspect Pronunciation Assessment" (Interspeech 2023).☆20Updated last year
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆29Updated last year
- [ICASSP 2025] Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners".☆28Updated last month
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆120Updated 4 years ago
- ☆47Updated 2 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆10Updated 2 months ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆36Updated last year
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆13Updated last year
- A multimodal approach on emotion recognition using audio and text.☆182Updated 5 years ago
- Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer☆30Updated 4 years ago
- the implementation of chunk-level attention-based temporal aggregation framework for sequence-to-one recognition tasks☆9Updated last year
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆21Updated last year
- Alzheimer's Dementia Recognition through Spontaneous Speech The ADReSSo Challenge☆11Updated 2 years ago