bagustris / ccc_mse_serView external linksLinks
Repository for my paper: Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recognition
☆20Mar 13, 2024Updated last year
Alternatives and similar repositories for ccc_mse_ser
Users that are interested in ccc_mse_ser are comparing it to the libraries listed below
Sorting:
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Oct 24, 2023Updated 2 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆18Aug 9, 2023Updated 2 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- ☆10Aug 16, 2024Updated last year
- MSP-Podcast Challenge Baseline Code☆30Jun 12, 2024Updated last year
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 5, 2025Updated last year
- Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)☆12Mar 12, 2024Updated last year
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Mar 28, 2023Updated 2 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 2 years ago
- How to use our public wav2vec2 age and gender model☆53Sep 4, 2023Updated 2 years ago
- Workflow for forced alignment between languages☆23Jan 13, 2026Updated last month
- A unified dataset of multilingual emotional human utterances☆29Jan 16, 2026Updated 3 weeks ago
- VAD analysis of text using some affective lexicon (ANEW, SENTIWORDNET, and VADER)☆28Mar 17, 2022Updated 3 years ago
- Collection of scripts from mHuBERT-147.☆32Nov 19, 2024Updated last year
- Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET☆27Mar 11, 2022Updated 3 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆31Apr 29, 2022Updated 3 years ago
- Training material on writing machine learning code with PyTorch by ICCS☆39Sep 11, 2025Updated 5 months ago
- FG2021: Cross Attentional AV Fusion for Dimensional Emotion Recognition☆33Nov 29, 2024Updated last year
- VS Code Extension for Multipass☆10Sep 25, 2024Updated last year
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆12Apr 15, 2025Updated 9 months ago
- This is "ready from box" face recognition app, based on Mediapipe, dlib and face_recognition modules.☆11Dec 31, 2023Updated 2 years ago
- Punch Out Model Synthesis - a program for constraint based tiling generation☆18Feb 1, 2026Updated 2 weeks ago
- SocksSharp provides support for Socks4/4a/5 proxy servers to HttpClient☆12Feb 3, 2021Updated 5 years ago
- ☆10Dec 17, 2020Updated 5 years ago
- The electronic Holly Quran browser Elforkane☆11Nov 14, 2021Updated 4 years ago
- PASE: Phonologically Anchored Speech Enhancer☆37Dec 10, 2025Updated 2 months ago
- Package containing the tools necessary for decomposing a speech signal into its modulated components (also known as AM-FM decomposition).…☆92May 23, 2025Updated 8 months ago
- ☆46Nov 2, 2025Updated 3 months ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆46May 12, 2023Updated 2 years ago
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).☆43Feb 16, 2022Updated 3 years ago
- IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"☆45Nov 29, 2024Updated last year
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated last year
- ☆12Aug 5, 2022Updated 3 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- A C++ implementation of stft, melspectrogram and mel_to_stft☆10Jun 2, 2022Updated 3 years ago