Repository for my paper: Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recognition
☆20Mar 13, 2024Updated 2 years ago
Alternatives and similar repositories for ccc_mse_ser
Users that are interested in ccc_mse_ser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Oct 24, 2023Updated 2 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆22Aug 9, 2023Updated 2 years ago
- MSP-Podcast Challenge Baseline Code☆31Jun 12, 2024Updated last year
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 3 years ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- How to use our public wav2vec2 age and gender model☆55Sep 4, 2023Updated 2 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 3 years ago
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).☆43Feb 16, 2022Updated 4 years ago
- A unified dataset of multilingual emotional human utterances☆28Jan 16, 2026Updated 3 months ago
- Trustworthy Speech Emotion Recognition☆13May 22, 2023Updated 2 years ago
- Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…☆11Jun 19, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET☆27Mar 11, 2022Updated 4 years ago
- ☆12Nov 25, 2023Updated 2 years ago
- Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"☆11Dec 16, 2020Updated 5 years ago
- Kalman Filter for ARX or NARX models' parameters estimation.☆15Dec 10, 2019Updated 6 years ago
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆45Mar 25, 2024Updated 2 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆33Nov 29, 2024Updated last year
- ☆16Apr 4, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆47Nov 29, 2024Updated last year
- This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…☆50Apr 11, 2022Updated 4 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆31Apr 29, 2022Updated 4 years ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆17Aug 26, 2025Updated 8 months ago
- This is a code repository for Relation Transformer Network☆13Nov 30, 2021Updated 4 years ago
- Mask Attention Networks: Rethinking and Strengthen Transformer in NAACL2021☆14Jun 3, 2021Updated 4 years ago
- Implementation of our paper "Exploiting Unsupervised Data for Emotion Recognition in Conversations" in the Findings of EMNLP-2020.☆13Nov 17, 2020Updated 5 years ago
- Maltab code for extraction of Mel Frequency Cepstral Coefficients☆13Mar 18, 2016Updated 10 years ago
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- cross-modal model between audio(MFCC) and text(KoBERT)☆12Jan 14, 2021Updated 5 years ago
- Histopathologic Cancer Detection model based on Kaggle Challenge https://www.kaggle.com/c/histopathologic-cancer-detection (top 1%)☆11Feb 16, 2021Updated 5 years ago
- VS Code Extension for Multipass☆10Sep 25, 2024Updated last year
- MFCC features + SVM for speech emotion classification☆16Oct 21, 2020Updated 5 years ago
- ☆12Nov 28, 2022Updated 3 years ago
- 【Star us and watch this project grow! 🌱⭐️】A Spring Boot-based e-commerce microservices template with comprehensive setup guides. Ideal f…☆21Apr 7, 2026Updated 3 weeks ago
- ☆12Aug 5, 2022Updated 3 years ago