scutcsq/DWFormer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scutcsq/DWFormer)

scutcsq / DWFormer

DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)

☆69

Alternatives and similar repositories for DWFormer

Users that are interested in DWFormer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ECNU-Cross-Innovation-Lab / ShiftSER
View on GitHub
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
☆39Dec 18, 2023Updated 2 years ago
ECNU-Cross-Innovation-Lab / ENT
View on GitHub
[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition
☆28Apr 11, 2024Updated 2 years ago
HappyColor / DST
View on GitHub
Deformable Speech Transformer (DST)
☆35Aug 8, 2024Updated last year
Jiaxin-Ye / TIM-Net_SER
View on GitHub
[ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…
☆191May 15, 2024Updated 2 years ago
AryaAftab / LIGHT-SERNET
View on GitHub
Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition
☆83May 25, 2022Updated 4 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
ASolitaryMan / HFLEA
View on GitHub
FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION
☆23Dec 22, 2024Updated last year
HappyColor / Vesper
View on GitHub
A Compact and Effective Pretrained Model for Speech Emotion Recognition
☆55Apr 10, 2026Updated 3 months ago
Vincent-ZHQ / CA-MSER
View on GitHub
Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information
☆163Nov 27, 2023Updated 2 years ago
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
lixiangucas01 / GLAM
View on GitHub
This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…
☆49Apr 11, 2022Updated 4 years ago
Sreyan88 / MMER
View on GitHub
Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition
☆83Mar 12, 2024Updated 2 years ago
HappyColor / SpeechFormer2
View on GitHub
SpeechFormer++ in PyTorch
☆51Jul 21, 2023Updated 3 years ago
bubaimaji / cmt-mser
View on GitHub
"MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23
☆24Feb 26, 2023Updated 3 years ago
usc-sail / trust-ser
View on GitHub
Trustworthy Speech Emotion Recognition
☆13May 22, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
BenoitWang / Speech_Emotion_Diarization
View on GitHub
☆71Sep 13, 2024Updated last year
HappyColor / SpeechFormer
View on GitHub
Official implement of SpeechFormer written in Python (PyTorch).
☆78Apr 1, 2023Updated 3 years ago
jh-cha-prml / JELLY
View on GitHub
Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"
☆14Nov 5, 2024Updated last year
b04901014 / FT-w2v2-ser
View on GitHub
Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition
☆152Oct 26, 2021Updated 4 years ago
zxzhao0 / C2SER
View on GitHub
We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…
☆49Mar 3, 2025Updated last year
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
zyh9929 / RL-EMO
View on GitHub
☆15Sep 2, 2023Updated 2 years ago
suhitaghosh10 / emo-stargan
View on GitHub
Implementation of Emo-StarGAN
☆48Dec 19, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
diaoenmao / Speech-Emotion-Recognition-with-Dual-Sequence-LSTM-Architecture
View on GitHub
[ICASSP 2020] Speech Emotion Recognition with Dual-Sequence LSTM Architecture
☆12Jan 17, 2025Updated last year
slSeanWU / beats-conformer-bart-audio-captioner
View on GitHub
PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…
☆41Jan 6, 2024Updated 2 years ago
lessonxmk / Optimized_attention_for_SER
View on GitHub
☆41Oct 13, 2020Updated 5 years ago
Fsoft-AIC / ConxGNN
View on GitHub
[ICASSP 2025] Effective Context Modeling Framework for Emotion Recognition in Conversations
☆15Mar 14, 2025Updated last year
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
julianyulu / icassp2021-mscnn-spu
View on GitHub
Code for our paper "Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention" (ICASSP 2021, co-first authorship)
☆28Jun 8, 2021Updated 5 years ago
declare-lab / speech-adapters
View on GitHub
Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech und…
☆43Mar 12, 2023Updated 3 years ago
AnkushMalaker / pretrained-dcnn-attention-ser
View on GitHub
Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"
☆10Dec 19, 2021Updated 4 years ago
wonjune-kang / expressive-speech-retrieval
View on GitHub
Expressive Speech Retrieval using Natural Language Descriptions of Speaking Style
☆15Aug 18, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
razvan404 / multimodal-speech-emotion-recognition
View on GitHub
Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…
☆11Jun 19, 2024Updated 2 years ago
ddlBoJack / emotion2vec
View on GitHub
[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training fo…
☆1,163Dec 23, 2024Updated last year
EIHW / EmoNet
View on GitHub
☆29Mar 8, 2022Updated 4 years ago
zerohd4869 / MM-DFN
View on GitHub
Source code for ICASSP 2022 paper "MM-DFN: Multimodal Dynamic Fusion Network For Emotion Recognition in Conversations".
☆94Apr 21, 2023Updated 3 years ago
habla-liaa / ser-with-w2v2
View on GitHub
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
☆140Jan 6, 2025Updated last year
JabuMlDev / Speaker-VGG-CCT
View on GitHub
Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…
☆25Feb 17, 2023Updated 3 years ago
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year