HappyColor/SpeechFormer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HappyColor/SpeechFormer)

HappyColor / SpeechFormer

Official implement of SpeechFormer written in Python (PyTorch).

☆78

Alternatives and similar repositories for SpeechFormer

Users that are interested in SpeechFormer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HappyColor / SpeechFormer2
View on GitHub
SpeechFormer++ in PyTorch
☆50Jul 21, 2023Updated 3 years ago
HappyColor / Vesper
View on GitHub
A Compact and Effective Pretrained Model for Speech Emotion Recognition
☆54Apr 10, 2026Updated 3 months ago
ECNU-Cross-Innovation-Lab / ENT
View on GitHub
[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition
☆28Apr 11, 2024Updated 2 years ago
ECNU-Cross-Innovation-Lab / ShiftSER
View on GitHub
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
☆39Dec 18, 2023Updated 2 years ago
speechandlanguageprocessing / ICASSP2022-Depression
View on GitHub
Automatic Depression Detection: a GRU/ BiLSTM-based Model and An Emotional Audio-Textual Corpus
☆216Jul 10, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
mortezaro / ad-recognition-from-speech
View on GitHub
☆12Apr 18, 2021Updated 5 years ago
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
ASolitaryMan / HFLEA
View on GitHub
FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION
☆23Dec 22, 2024Updated last year
chuyuanli / MTL4Depr
View on GitHub
Source code for paper Multi-Task Learning for Depression Detection in Dialogs (SIGDial 2022)
☆12Jan 18, 2025Updated last year
scutcsq / DWFormer
View on GitHub
DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)
☆69Jul 8, 2024Updated 2 years ago
HappyColor / DST
View on GitHub
Deformable Speech Transformer (DST)
☆35Aug 8, 2024Updated last year
AryaAftab / LIGHT-SERNET
View on GitHub
Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition
☆83May 25, 2022Updated 4 years ago
Jiaxin-Ye / TIM-Net_SER
View on GitHub
[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…
☆191May 15, 2024Updated 2 years ago
usc-sail / peft-ser
View on GitHub
[ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…
☆60Jul 1, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AliceOTHMANI / EmoAudioNet
View on GitHub
Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)
☆14Jul 13, 2020Updated 6 years ago
adbailey1 / DepAudioNet_reproduction
View on GitHub
Reproduction of DepAudioNet by Ma et al. {DepAudioNet: An Efficient Deep Model for Audio based Depression Classification,(https://dl.acm.…
☆83Sep 9, 2021Updated 4 years ago
lixiangucas01 / GLAM
View on GitHub
This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…
☆49Apr 11, 2022Updated 4 years ago
Vincent-ZHQ / CA-MSER
View on GitHub
Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information
☆163Nov 27, 2023Updated 2 years ago
mmakiuchi / multimodal_emotion_recognition
View on GitHub
Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in…
☆52Sep 14, 2021Updated 4 years ago
TideDancer / interspeech21_emotion
View on GitHub
☆111Aug 10, 2022Updated 3 years ago
aris-ai / Audio-and-text-based-emotion-recognition
View on GitHub
A multimodal approach on emotion recognition using audio and text.
☆187Jun 15, 2020Updated 6 years ago
Strong-AI-Lab / emotion
View on GitHub
Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,
☆57Oct 19, 2025Updated 9 months ago
glam-imperial / semantic_speech_emotion_recognition
View on GitHub
This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…
☆27Mar 18, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
xirongc / watermark-audio-diffusion
View on GitHub
☆12Nov 21, 2023Updated 2 years ago
nextDeve / Depression-detect-ResNet
View on GitHub
depression-detect Predicting depression from AVEC2014 using ResNet18.
☆60Jun 17, 2024Updated 2 years ago
fchest / CSENet
View on GitHub
Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)
☆14Jun 23, 2022Updated 4 years ago
biggytruck / SpeechSplit2
View on GitHub
Official implementation of SpeechSplit2
☆135Oct 22, 2022Updated 3 years ago
Takaaki-Saeki / ssl_speech_restoration_v2
View on GitHub
☆17Dec 18, 2023Updated 2 years ago
b04901014 / FT-w2v2-ser
View on GitHub
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
☆152Oct 26, 2021Updated 4 years ago
NeuroByte-Consulting / Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs
View on GitHub
Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…
☆12Apr 28, 2025Updated last year
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
voidful / MMLM
View on GitHub
Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra
☆16Dec 10, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ZhiqiWang12-hash / text_audio_classification
View on GitHub
Chinese BERT classification with tf2.0 and audio classification with mfcc
☆14Dec 2, 2020Updated 5 years ago
usc-sail / trust-ser
View on GitHub
Trustworthy Speech Emotion Recognition
☆13May 22, 2023Updated 3 years ago
kingformatty / NUSD
View on GitHub
☆25Nov 3, 2024Updated last year
Sreyan88 / MMER
View on GitHub
Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition
☆83Mar 12, 2024Updated 2 years ago
v-manhlt3 / m-LTM-Audio-Text-Retrieval
View on GitHub
☆13Jan 5, 2025Updated last year
Janie1996 / MSRFG
View on GitHub
The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations
☆11Jan 17, 2023Updated 3 years ago
thuhcsi / SECap
View on GitHub
☆181Jul 9, 2024Updated 2 years ago