Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.
☆17Aug 8, 2021Updated 4 years ago
Alternatives and similar repositories for SER-wav2vec
Users that are interested in SER-wav2vec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆16Feb 17, 2022Updated 4 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆152Oct 26, 2021Updated 4 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆139Jan 6, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆96Jul 6, 2023Updated 2 years ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆83Mar 12, 2024Updated 2 years ago
- VQCPC-GAN: Variable-length Adversarial Audio Synthesis using Vector-Quantized Contrastive Predictive Coding☆14Apr 27, 2021Updated 5 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆414Sep 30, 2024Updated last year
- Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.☆49Aug 2, 2024Updated last year
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Sep 26, 2022Updated 3 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Nov 15, 2025Updated 6 months ago
- A lightweight library to compute Diarization Error Rate (DER).☆62Jan 14, 2026Updated 4 months ago
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆164Nov 27, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆54Apr 10, 2026Updated last month
- Extract frequency, power, width and dissonance of formants from wav files☆28Jun 3, 2022Updated 3 years ago
- mmyun☆17Aug 4, 2025Updated 9 months ago
- ☆18Aug 29, 2022Updated 3 years ago
- Text Augmentation for Machine Learning tasks. Small data: How to grow your text dataset for classification ?☆22Jan 18, 2019Updated 7 years ago
- Real-time version of sound_classification_demo in OpenVINO toolkit. Captures audio from microphone, do classification, and display result…☆12Jul 28, 2021Updated 4 years ago
- Domain Adaptation with Adversarial Training on Penultimate Activations (AAAI 2023)☆11Aug 1, 2023Updated 2 years ago
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- Reimplementation of speech decoding 2022 paper by MetaAI☆14Oct 17, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 解决Cursor在免费订阅期间出现以下提示的问题: Too many free trial accounts used on this machine. Please upgrade to pro. We have this limit in place to preve…☆10Dec 14, 2024Updated last year
- Gradual Source Domain Expansion for Unsupervised Domain Adaptation☆14Jun 10, 2025Updated 11 months ago
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 6 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- ☆29Dec 23, 2025Updated 5 months ago
- ☆13Jan 11, 2024Updated 2 years ago
- Minimal module for computing audio spectrograms☆15Feb 28, 2019Updated 7 years ago
- ☆15Oct 15, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Multimodal preprocessing on IEMOCAP dataset☆13Jun 8, 2018Updated 7 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆52Oct 8, 2021Updated 4 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"☆17Jul 8, 2020Updated 5 years ago
- Transfer learning exploration of dc_tts text-to-speech model☆21Mar 5, 2019Updated 7 years ago
- Adaptive Global-Local Representation Learning and Selection for Cross-Domain Facial Expression Recognition (TMM 2024)☆17Aug 13, 2024Updated last year
- docker for HF wav2vec2-sprint☆13Mar 26, 2021Updated 5 years ago