Identify the emotion of multiple speakers in an Audio Segment
☆179Feb 12, 2023Updated 3 years ago
Alternatives and similar repositories for MevonAI-Speech-Emotion-Recognition
Users that are interested in MevonAI-Speech-Emotion-Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated last month
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- Building and training Speech Emotion Recognizer that predicts human emotions using Python, Sci-kit learn and Keras☆673Nov 3, 2023Updated 2 years ago
- Speech Emotion Recognition (SER) in real-time, using Deep Neural Networks (DNN) of Long Short Memory Term (LSTM).☆115Mar 6, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)☆1,404Feb 7, 2023Updated 3 years ago
- Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.☆48Aug 2, 2024Updated last year
- [ICASSP 2020] Speech Emotion Recognition with Dual-Sequence LSTM Architecture☆12Jan 17, 2025Updated last year
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆409Sep 30, 2024Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Apr 10, 2023Updated 2 years ago
- Understanding emotions from audio files using neural networks and multiple datasets.☆432Jul 1, 2023Updated 2 years ago
- Using Convolutional Neural Networks in speech emotion recognition on the RAVDESS Audio Dataset.☆146Apr 12, 2021Updated 4 years ago
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 4 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆140Jan 6, 2025Updated last year
- Speaker independent emotion recognition☆330Jun 26, 2024Updated last year
- Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别☆1,291Mar 25, 2023Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Dec 10, 2020Updated 5 years ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Mar 18, 2021Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- ☆80Aug 8, 2025Updated 7 months ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- AI voicebox on Raspberry Pi☆12Jan 27, 2026Updated 2 months ago
- Project Made during Virtual Summer Internship under leadingindia.ai and BENNETT UNIVERSITY.☆98Feb 12, 2023Updated 3 years ago
- Speech Emotion Recognition☆28Jun 19, 2020Updated 5 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Sep 16, 2022Updated 3 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆153Oct 26, 2021Updated 4 years ago
- Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)☆446Dec 21, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆40Jan 14, 2022Updated 4 years ago
- feature extraction from speech signals☆395Jun 15, 2025Updated 9 months ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆81Mar 12, 2024Updated 2 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Nov 1, 2023Updated 2 years ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Sep 26, 2022Updated 3 years ago
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆121Feb 26, 2021Updated 5 years ago
- ☆67Aug 16, 2023Updated 2 years ago