LetianLee/Speech-Emotion-Recognition

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/LetianLee/Speech-Emotion-Recognition)

LetianLee / Speech-Emotion-Recognition

An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning on the RAVDESS dataset.

☆34

Alternatives and similar repositories for Speech-Emotion-Recognition

Users that are interested in Speech-Emotion-Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Jason-Oleana / speech-emotion-classification
View on GitHub
MFCC features + SVM for speech emotion classification
☆16Oct 21, 2020Updated 5 years ago
gianscuri / Emotion-Recognition_SER-FER_RAVDESS
View on GitHub
Multi-modal Human Emotion Recognition of speech clips (audio + video) contained in RAVDESS dataset using a two stream architecture
☆32Mar 2, 2023Updated 3 years ago
HoseinAzad / Transformer-based-SER
View on GitHub
Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch
☆42Apr 12, 2024Updated 2 years ago
OmarMohammed88 / AR-Emotion-Recognition
View on GitHub
An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…
☆16Feb 17, 2022Updated 4 years ago
archinetai / aligner-pytorch
View on GitHub
Sequence alignement methods with helpers for PyTorch.
☆24Nov 30, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tuncayka / speech_emotion
View on GitHub
The Ryerson Audio-Visual Database of Emotional Speech and Song (RAVDESS)
☆19Dec 8, 2022Updated 3 years ago
OSU-slatelab / LibriStutter
View on GitHub
A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain
☆11Mar 13, 2021Updated 5 years ago
AndreaLombax / Speech_emotion_recognition
View on GitHub
In this work is proposed a speech emotion recognition model based on the extraction of four different features got from RAVDESS sound fil…
☆10Feb 27, 2022Updated 4 years ago
amritkromana / disfluency_detection_from_audio
View on GitHub
☆35Aug 22, 2024Updated last year
akhil2495 / multi-modal-emotion-recognition
View on GitHub
A repository for emotion recognition from speech, text and mocap data from IEMOCAP dataset
☆13Dec 12, 2018Updated 7 years ago
kosuke-kitahara / xlsr-wav2vec2-phoneme-recognition
View on GitHub
☆27Mar 29, 2021Updated 5 years ago
piotrkawa / deepfake-whisper-features
View on GitHub
Implementation of the paper "Improved DeepFake Detection Using Whisper Features"
☆117Apr 9, 2025Updated last year
LorenzoGianassi / Land-Diffuser
View on GitHub
The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…
☆13Dec 23, 2023Updated 2 years ago
trinhtuanvubk / Diff-VC
View on GitHub
Diffusion Model for Voice Conversion
☆72Mar 14, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
NeuroByte-Consulting / Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs
View on GitHub
Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…
☆12Apr 28, 2025Updated last year
WangHelin1997 / DuTa-VC
View on GitHub
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆38Dec 5, 2023Updated 2 years ago
hirokisince1998 / jasj-bibtex
View on GitHub
日本音響学会誌用BibTeXスタイルファイル
☆11Jan 24, 2022Updated 4 years ago
WangHelin1997 / SpeechTasks
View on GitHub
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…
☆83Jun 7, 2024Updated 2 years ago
Meatfucker / metatron2
View on GitHub
A Multimodal Discord bot with machine learning functions, including LLM chat, Image generation, and Speech Generation capabilities
☆12Jan 7, 2024Updated 2 years ago
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
habla-liaa / ser-with-w2v2
View on GitHub
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
☆140Jan 6, 2025Updated last year
b04901014 / FT-w2v2-ser
View on GitHub
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
☆153Oct 26, 2021Updated 4 years ago
Hypotheses-Paradise / UADF
View on GitHub
☆17May 5, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
mzarvandi / SER-wav2vec
View on GitHub
Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.
☆17Aug 8, 2021Updated 4 years ago
hwang9u / emocatcher
View on GitHub
[RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)
☆31Sep 29, 2023Updated 2 years ago
samsad35 / VQ-MAE-S-code
View on GitHub
[ICASSPW] A Vector Quantized Masked AutoEncoder for speech emotion recognition
☆30Mar 4, 2024Updated 2 years ago
WangHelin1997 / Automatic_Speech_Annotator
View on GitHub
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Jun 14, 2024Updated 2 years ago
LibreCV / blog
View on GitHub
Blog of the LibreCV.org
☆10May 17, 2021Updated 5 years ago
Data-Science-kosta / Speech-Emotion-Classification-with-PyTorch
View on GitHub
This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.
☆213Nov 10, 2022Updated 3 years ago
Opla / SmallData-Augmentation-MachineLearning
View on GitHub
Text Augmentation for Machine Learning tasks. Small data: How to grow your text dataset for classification ?
☆22Jan 18, 2019Updated 7 years ago
tira-io / tira
View on GitHub
The source code for the TIRA Shared Task Platform
☆19Jul 14, 2026Updated 2 weeks ago
odunola499 / f5-lora
View on GitHub
☆19Nov 18, 2025Updated 8 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
daanzu / kaldi_ag_training
View on GitHub
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…
☆21Jan 24, 2022Updated 4 years ago
PiotrSobczak / speech-emotion-recognition
View on GitHub
Multi-modal Speech Emotion Recogniton on IEMOCAP dataset
☆96Jul 6, 2023Updated 3 years ago
bagustris / deep-mlp-ser
View on GitHub
Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition
☆11Oct 24, 2023Updated 2 years ago
ko-nlp / moducorpus-sanitizer
View on GitHub
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 4 years ago
xiuwenz2 / SAPC-template
View on GitHub
☆17Mar 20, 2026Updated 4 months ago
ftshijt / Interspeech2024_DiscreteSpeechChallenge
View on GitHub
This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.
☆32Jan 26, 2024Updated 2 years ago
mansi-k / Stutter-Therapy
View on GitHub
Developed and trained Gated-CNN models to detect types of stutter in speech and SVM classifier to suggest new therapies to the user accor…
☆21Aug 20, 2021Updated 4 years ago