ECNU-Cross-Innovation-Lab / ENT
[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition
☆17Updated 6 months ago
Related projects ⓘ
Alternatives and complementary repositories for ENT
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆34Updated 10 months ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆27Updated 4 months ago
- SpeechFormer++ in PyTorch☆41Updated last year
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆50Updated 4 months ago
- ☆19Updated last year
- MSP-Podcast Challenge Baseline Code☆16Updated 4 months ago
- Trustworthy Speech Emotion Recognition☆13Updated last year
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆43Updated 2 years ago
- ☆41Updated last year
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆18Updated last year
- ☆9Updated 3 months ago
- ☆10Updated 11 months ago
- PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (…☆47Updated 4 months ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Updated last year
- Multi-Task Speech classification of accent and gender of an english speaker on Mozilla's common voice dataset☆23Updated 2 months ago
- ☆59Updated last month
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆31Updated 2 years ago
- Official implement of SpeechFormer written in Python (PyTorch).☆75Updated last year
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆11Updated 5 months ago
- EMO-SUPERB submission☆28Updated 2 months ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆41Updated 3 years ago
- Csenet: Complex Squeeze-and-Excitation Network for Speech Depression Level Prediction (ICASSP 2022)☆14Updated 2 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆16Updated last year
- ☆12Updated 3 weeks ago
- Learning differentiable temporal resolution on time-series data.☆32Updated last year
- ☆26Updated last year
- ☆45Updated last year
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆38Updated last year
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆18Updated 10 months ago
- ☆17Updated 10 months ago