ETZET / SpeechEmotionAVLearning
☆11Updated last year
Alternatives and similar repositories for SpeechEmotionAVLearning:
Users that are interested in SpeechEmotionAVLearning are comparing it to the libraries listed below
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆32Updated 7 months ago
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆21Updated 10 months ago
- ☆19Updated last year
- MSP-Podcast Challenge Baseline Code☆20Updated 8 months ago
- Learning differentiable temporal resolution on time-series data.☆35Updated 2 years ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆22Updated 2 months ago
- ☆43Updated 2 years ago
- ☆63Updated 5 months ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆38Updated 2 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆41Updated 3 years ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆36Updated last year
- Official implement of SpeechFormer written in Python (PyTorch).☆77Updated last year
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆37Updated 4 years ago
- For students who would like to apply for RA, PhD, postdoc in audio research.☆24Updated 3 months ago
- ☆30Updated last year
- SpeechFormer++ in PyTorch☆47Updated last year
- ☆49Updated last year
- ☆104Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 2 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆18Updated last year
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆86Updated 3 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆24Updated 10 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆16Updated last year
- A Diffusion Probabilistic Model for Target Sound Extraction☆36Updated 4 months ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆39Updated last year
- COG-MHEAR Audio-Visual Speech Enhancement Challenge☆34Updated 10 months ago
- ☆32Updated 2 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago