habla-liaa / ser-with-w2v2View external linksLinks
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
☆140Jan 6, 2025Updated last year
Alternatives and similar repositories for ser-with-w2v2
Users that are interested in ser-with-w2v2 are comparing it to the libraries listed below
Sorting:
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆153Oct 26, 2021Updated 4 years ago
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆119Feb 26, 2021Updated 4 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆274Apr 2, 2022Updated 3 years ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Mar 18, 2021Updated 4 years ago
- ☆112Aug 10, 2022Updated 3 years ago
- Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)☆442Dec 21, 2023Updated 2 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆399Sep 30, 2024Updated last year
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Dec 20, 2020Updated 5 years ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆81Mar 12, 2024Updated last year
- Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.☆17Aug 8, 2021Updated 4 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 2 years ago
- ☆10Sep 6, 2020Updated 5 years ago
- Repository for code and paper submitted for APSIPA 2019, Lanzhou, China☆21Aug 2, 2024Updated last year
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆50Apr 11, 2022Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80May 20, 2023Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- LoRA-based phoneme/prosody control for LLM-based TTS with no G2P - Lightweight adapter for edit and control the target language's phoneme…☆22Aug 14, 2025Updated 6 months ago
- speech emotion recognition using a convolutional recurrent networks based on IEMOCAP☆406Jul 8, 2019Updated 6 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆187May 15, 2024Updated last year
- [ACII 2023] PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Spe…☆60Jul 1, 2024Updated last year
- ☆42Oct 13, 2020Updated 5 years ago
- DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)☆69Jul 8, 2024Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition☆83May 25, 2022Updated 3 years ago
- PyTorch implementation of the models described in the IEEE ICASSP 2022 paper "Is cross-attention preferable to self-attention for multi-m…☆63Mar 29, 2025Updated 10 months ago
- Code for our paper "Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention" (ICASSP 2021, co-first authorship)☆28Jun 8, 2021Updated 4 years ago
- AuxFormer: Robust Approach to Audiovisual Emotion Recognition☆14Mar 14, 2023Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆164Nov 27, 2023Updated 2 years ago
- A tool to collect/validate audio recordings from workers on Amazon Mechanical Turk. Written in Python/Flask. (originally hosted on github…☆14Dec 19, 2022Updated 3 years ago
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆27Apr 11, 2024Updated last year
- Official implement of SpeechFormer written in Python (PyTorch).☆78Apr 1, 2023Updated 2 years ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆40Dec 18, 2023Updated 2 years ago
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Sep 4, 2023Updated 2 years ago
- How to use our public wav2vec2 dimensional emotion model☆539May 22, 2023Updated 2 years ago
- 1st Place Public Leaderboard Solution for ERC2019☆70Jan 13, 2020Updated 6 years ago
- ☆17Nov 30, 2021Updated 4 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- ☆16Oct 7, 2022Updated 3 years ago