Speech Emotion Recognition using transfer learning with wav2vec on IEMOCAP.
☆17Aug 8, 2021Updated 4 years ago
Alternatives and similar repositories for SER-wav2vec
Users that are interested in SER-wav2vec are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆153Oct 26, 2021Updated 4 years ago
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆96Jul 6, 2023Updated 2 years ago
- ECTD Document Classification using sklearn and Machine Learning☆10Nov 14, 2019Updated 6 years ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆83Mar 12, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- VQCPC-GAN: Variable-length Adversarial Audio Synthesis using Vector-Quantized Contrastive Predictive Coding☆14Apr 27, 2021Updated 5 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆413Sep 30, 2024Updated last year
- Predicting various emotion in human speech signal by detecting different speech components affected by human emotion.☆49Aug 2, 2024Updated last year
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆74Sep 26, 2022Updated 3 years ago
- Multi-Label Text Classification by fine-tuning BERT and XLNet and deployment using Flask☆15Apr 13, 2021Updated 5 years ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆14Nov 15, 2025Updated 5 months ago
- A lightweight library to compute Diarization Error Rate (DER).☆62Jan 14, 2026Updated 3 months ago
- The Hybrid Image Matching (HIM) method that combines the deep learning approach with the feature point matching to image classification.☆15Jan 9, 2019Updated 7 years ago
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆164Nov 27, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆54Apr 10, 2026Updated 3 weeks ago
- Extract frequency, power, width and dissonance of formants from wav files☆28Jun 3, 2022Updated 3 years ago
- ☆18Aug 29, 2022Updated 3 years ago
- Text Augmentation for Machine Learning tasks. Small data: How to grow your text dataset for classification ?☆22Jan 18, 2019Updated 7 years ago
- Domain Adaptation with Adversarial Training on Penultimate Activations (AAAI 2023)☆11Aug 1, 2023Updated 2 years ago
- Code for "Phoneme Segmentation Using Self-Supervised Speech Models", Strgar & Harwath, Proceedings of the IEEE Spoken Language Technology…☆55Nov 4, 2022Updated 3 years ago
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆50Apr 11, 2022Updated 4 years ago
- 解决Cursor在免费订阅期间出现以下提示的问题: Too many free trial accounts used on this machine. Please upgrade to pro. We have this limit in place to preve…☆10Dec 14, 2024Updated last year
- Library & parser for IBM Advanced Function Presentation (AFP) document/print stream format☆19Oct 27, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆10Aug 14, 2023Updated 2 years ago
- Gradual Source Domain Expansion for Unsupervised Domain Adaptation☆14Jun 10, 2025Updated 10 months ago
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- T5Voice is a lightweight PyTorch implementation of T5-based text-to-speech synthesis, supporting both streaming and non-streaming speech …☆28Nov 7, 2025Updated 5 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- ☆13Jan 11, 2024Updated 2 years ago
- Minimal module for computing audio spectrograms☆15Feb 28, 2019Updated 7 years ago
- ☆15Oct 15, 2020Updated 5 years ago
- Multimodal preprocessing on IEMOCAP dataset☆13Jun 8, 2018Updated 7 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆52Oct 8, 2021Updated 4 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"☆17Jul 8, 2020Updated 5 years ago
- Adaptive Global-Local Representation Learning and Selection for Cross-Domain Facial Expression Recognition (TMM 2024)☆17Aug 13, 2024Updated last year
- Pair Trading Analysis & Exercises Toolkit [Jupyter Notebook]☆12Nov 3, 2023Updated 2 years ago
- docker for HF wav2vec2-sprint☆13Mar 26, 2021Updated 5 years ago
- This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…☆12Jan 24, 2024Updated 2 years ago