Repository for my paper: Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recognition
☆20Mar 13, 2024Updated 2 years ago
Alternatives and similar repositories for ccc_mse_ser
Users that are interested in ccc_mse_ser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Oct 24, 2023Updated 2 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆22Aug 9, 2023Updated 2 years ago
- ☆10Aug 16, 2024Updated last year
- MSP-Podcast Challenge Baseline Code☆31Jun 12, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 3 months ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- How to use our public wav2vec2 age and gender model☆55Sep 4, 2023Updated 2 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 3 years ago
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).☆43Feb 16, 2022Updated 4 years ago
- A unified dataset of multilingual emotional human utterances☆29Jan 16, 2026Updated 5 months ago
- Trustworthy Speech Emotion Recognition☆13May 22, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…☆11Jun 19, 2024Updated last year
- Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET☆27Mar 11, 2022Updated 4 years ago
- ☆13Nov 25, 2023Updated 2 years ago
- VAD analysis of text using some affective lexicon (ANEW, SENTIWORDNET, and VADER)☆28Mar 17, 2022Updated 4 years ago
- Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"☆11Dec 16, 2020Updated 5 years ago
- Kalman Filter for ARX or NARX models' parameters estimation.☆15Dec 10, 2019Updated 6 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆45Mar 25, 2024Updated 2 years ago
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆34Nov 29, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- ☆16Apr 4, 2022Updated 4 years ago
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆47Nov 29, 2024Updated last year
- A C++ implementation of stft, melspectrogram and mel_to_stft☆11Jun 2, 2022Updated 4 years ago
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆49Apr 11, 2022Updated 4 years ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆18Aug 26, 2025Updated 9 months ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- This is a code repository for Relation Transformer Network☆13Nov 30, 2021Updated 4 years ago
- Implementation of our paper "Exploiting Unsupervised Data for Emotion Recognition in Conversations" in the Findings of EMNLP-2020.☆13Nov 17, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Maltab code for extraction of Mel Frequency Cepstral Coefficients☆13Mar 18, 2016Updated 10 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated 2 years ago
- cross-modal model between audio(MFCC) and text(KoBERT)☆12Jan 14, 2021Updated 5 years ago
- Histopathologic Cancer Detection model based on Kaggle Challenge https://www.kaggle.com/c/histopathologic-cancer-detection (top 1%)☆11Feb 16, 2021Updated 5 years ago
- MFCC features + SVM for speech emotion classification☆16Oct 21, 2020Updated 5 years ago
- ☆12Nov 28, 2022Updated 3 years ago
- The electronic Holly Quran browser Elforkane☆11Nov 14, 2021Updated 4 years ago