OmarMohammed88 / AR-Emotion-RecognitionLinks
An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://journals.scholarpublishing.org/index.php/TMLAI/article/view/11039
☆13Updated 3 years ago
Alternatives and similar repositories for AR-Emotion-Recognition
Users that are interested in AR-Emotion-Recognition are comparing it to the libraries listed below
Sorting:
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆14Updated last year
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆43Updated last year
- Wav2Vec for speech recognition, classification, and audio classification☆269Updated 3 years ago
- ☆60Updated 5 months ago
- Finetune Wa2vec 2.0 For Speech Recognition☆145Updated 10 months ago
- Generated Audio Samples by ALGAN-VC model are available in the folder☆19Updated 3 years ago
- ☆14Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 3 years ago
- ☆45Updated 3 years ago
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆97Updated 7 months ago
- This project is about performing Speaker diarization for Hindi Language.☆58Updated 4 years ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆32Updated 2 years ago
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Updated 3 years ago
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆13Updated 2 years ago
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆17Updated 2 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 5 years ago
- Speech to Text with self-supervised learning based on wav2vec 2.0 framework using Hugging Face's Transformer☆29Updated 4 years ago
- ☆16Updated 3 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆151Updated last year
- ☆27Updated 4 years ago
- ☆49Updated 2 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆34Updated last year
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆91Updated 5 years ago
- ☆49Updated 3 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆24Updated 2 years ago
- A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs☆23Updated last year
- Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.☆13Updated 4 years ago
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆119Updated 4 years ago
- The project is related to the development of labs for the ITMO Speaker Recognition Course.☆15Updated last month
- Time series course Fall 2019 project☆53Updated 5 years ago