bagustris/ccc_mse_ser

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bagustris/ccc_mse_ser)

bagustris / ccc_mse_ser

Repository for my paper: Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recognition

☆20

Alternatives and similar repositories for ccc_mse_ser

Users that are interested in ccc_mse_ser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bagustris / deep-mlp-ser
View on GitHub
Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition
☆11Oct 24, 2023Updated 2 years ago
bagustris / dimensional-ser
View on GitHub
Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning
☆17Aug 2, 2024Updated last year
msplabresearch / MSP-Podcast_Challenge
View on GitHub
MSP-Podcast Challenge Baseline Code
☆31Jun 12, 2024Updated 2 years ago
jayaneetha / emoDARTS
View on GitHub
☆10Aug 16, 2024Updated last year
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
liyunlongaaa / AD-TUNING
View on GitHub
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…
☆11Feb 23, 2024Updated 2 years ago
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
RicherMans / UIT_Mobile
View on GitHub
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
☆24Mar 6, 2023Updated 3 years ago
audeering / w2v2-age-gender-how-to
View on GitHub
How to use our public wav2vec2 age and gender model
☆55Sep 4, 2023Updated 2 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
michen00 / unified_multilingual_dataset_of_emotional_human_utterances
View on GitHub
A unified dataset of multilingual emotional human utterances
☆31Jan 16, 2026Updated 6 months ago
razvan404 / multimodal-speech-emotion-recognition
View on GitHub
Multimodal SER Model meant to be trained on recognising emotions from speech (text + acoustic data). Fine-tuned the DeBERTaV3 model, resp…
☆11Jun 19, 2024Updated 2 years ago
usc-sail / trust-ser
View on GitHub
Trustworthy Speech Emotion Recognition
☆13May 22, 2023Updated 3 years ago
flaviorainhoavila / IEMOCAPspeechEmotionRecognition
View on GitHub
Automatic speech emotion recognition based on transfer learning from spectrograms using ResNET
☆27Mar 11, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
utter-project / mHuBERT-147-scripts
View on GitHub
Collection of scripts from mHuBERT-147.
☆35Nov 19, 2024Updated last year
bagustris / text-vad
View on GitHub
VAD analysis of text using some affective lexicon (ANEW, SENTIWORDNET, and VADER)
☆28Mar 17, 2022Updated 4 years ago
ZhecanJamesWang / GLAT_SGG
View on GitHub
Code for GLAT (Global Local Transformer), ECCV 2020 "Learning Visual Commonsense for Robust Scene Graph Generation"
☆11Dec 16, 2020Updated 5 years ago
LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
AutuanLiu / Kalman-Filter
View on GitHub
Kalman Filter for ARX or NARX models' parameters estimation.
☆15Dec 10, 2019Updated 6 years ago
ddlBoJack / MT4SSL
View on GitHub
[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…
☆45Mar 25, 2024Updated 2 years ago
XinJiang1994 / HFmaml
View on GitHub
☆10Dec 17, 2020Updated 5 years ago
praveena2j / Joint-Cross-Attention-for-Audio-Visual-Fusion
View on GitHub
IEEE T-BIOM : "Audio-Visual Fusion for Emotion Recognition in the Valence-Arousal Space Using Joint Cross-Attention"
☆48Nov 29, 2024Updated last year
zzpDapeng / Transformer-Transducer
View on GitHub
A streamable speech recognition model with transformer encoders and RNN-T loss
☆11Mar 1, 2021Updated 5 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
View on GitHub
☆16Apr 4, 2022Updated 4 years ago
Dev-Bobbie / Color-Similarity-Calculation
View on GitHub
【算法】通过图像颜色计算图像的相似度
☆11Sep 16, 2020Updated 5 years ago
lixiangucas01 / GLAM
View on GitHub
This is the official code for paper "Speech Emotion Recognition with Global-Aware Fusion on Multi-scale Feature Representation" published…
☆49Apr 11, 2022Updated 4 years ago
RicherMans / PSL
View on GitHub
Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"
☆31Apr 29, 2022Updated 4 years ago
rajatkoner08 / rtn
View on GitHub
This is a code repository for Relation Transformer Network
☆13Nov 30, 2021Updated 4 years ago
IIGROUP / PUM
View on GitHub
[CVPR 2021] Pytorch implementation for Probabilistic Modeling of Semantic Ambiguity for Scene Graph Generation
☆19May 7, 2021Updated 5 years ago
wxjiao / Pre-CODE
View on GitHub
Implementation of our paper "Exploiting Unsupervised Data for Emotion Recognition in Conversations" in the Findings of EMNLP-2020.
☆13Nov 17, 2020Updated 5 years ago
LibertFan / MAN
View on GitHub
Mask Attention Networks: Rethinking and Strengthen Transformer in NAACL2021
☆14Jun 3, 2021Updated 5 years ago
yashbhalgat / Emotion-from-speech-MFCC
View on GitHub
Maltab code for extraction of Mel Frequency Cepstral Coefficients
☆13Mar 18, 2016Updated 10 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
juice500ml / xlm_to_xlsr
View on GitHub
Official implementation of the paper "Distilling a Pretrained Language Model to a Multilingual ASR Model" (Interspeech 2022)
☆12Mar 12, 2024Updated 2 years ago
ivanpanshin / hist_cancer
View on GitHub
Histopathologic Cancer Detection model based on Kaggle Challenge https://www.kaggle.com/c/histopathologic-cancer-detection (top 1%)
☆11Feb 16, 2021Updated 5 years ago
Donghwa-KIM / audiotext-transformer
View on GitHub
cross-modal model between audio(MFCC) and text(KoBERT)
☆12Jan 14, 2021Updated 5 years ago
Jason-Oleana / speech-emotion-classification
View on GitHub
MFCC features + SVM for speech emotion classification
☆16Oct 21, 2020Updated 5 years ago
ChanMeng666 / douyin-mall-java-template
View on GitHub
【Star us and watch this project grow! 🌱⭐️】A Spring Boot-based e-commerce microservices template with comprehensive setup guides. Ideal f…
☆24Jun 16, 2026Updated last month
haoyanbin918 / Attention-in-Attention
View on GitHub
☆12Aug 5, 2022Updated 3 years ago
torchopenl3 / torchopenl3
View on GitHub
☆20Aug 26, 2022Updated 3 years ago