SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
☆15Jan 23, 2024Updated 2 years ago
Alternatives and similar repositories for mmser
Users that are interested in mmser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Tensorflow implementation of Speech Emotion Recognition using Audio signals and Text Data☆12May 16, 2022Updated 3 years ago
- Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…☆12Apr 28, 2025Updated last year
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Dec 23, 2023Updated 2 years ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆33Sep 29, 2023Updated 2 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆25Jun 23, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Attention Based Multi-modal Emotion Recognition; Stanford Emotional Narratives Dataset☆17Aug 21, 2019Updated 6 years ago
- SpeechFormer++ in PyTorch☆50Jul 21, 2023Updated 2 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆25Feb 17, 2023Updated 3 years ago
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆41Mar 7, 2024Updated 2 years ago
- Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.☆12Sep 30, 2021Updated 4 years ago
- PorSimplesSent - A Portuguese corpus of aligned sentences pairs to investigate sentence readability assessment☆14Jan 15, 2020Updated 6 years ago
- Models Supported: VGG11, VGG13, VGG16, VGG16_v2, VGG19 (1D and 2D versions with DEMO for Classification and Regression).☆17Nov 25, 2021Updated 4 years ago
- The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations☆11Jan 17, 2023Updated 3 years ago
- ☆12Nov 25, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆18Aug 15, 2024Updated last year
- Companion repository for the EUSIPCO-24 accepted paper "Pre-Training Music Classification Models via Music Source Separation"☆12Aug 30, 2024Updated last year
- PySYCL is an open-source Python interface for SYCL.☆15Apr 18, 2025Updated last year
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Oct 24, 2023Updated 2 years ago
- Paper list of compositional zero-shot learning☆11Jul 5, 2022Updated 3 years ago
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- ☆10Aug 16, 2024Updated last year
- [CVPR 2025] Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation☆28Dec 9, 2025Updated 5 months ago
- Code for reproducing the experiments and results of "Multi-Source Contrastive Learning from Musical Audio", accepted for publication in S…☆17Nov 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- CycleTransGAN-EVC: A CycleGAN-based Emotional Voice Conversion Model with Transformer☆35Feb 4, 2025Updated last year
- Motion U-Net is multi-cue autoencoder deep architecture for robust moving object detection☆15Jul 19, 2023Updated 2 years ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆16Aug 4, 2023Updated 2 years ago
- O primeiro e mais querido podcast sobre ciência de dados no Brasil☆14Jun 21, 2023Updated 2 years ago
- Random collection of code snippets used to create Deep Fakes☆15Jun 30, 2019Updated 6 years ago
- ☆12Nov 10, 2024Updated last year
- ☆14Oct 10, 2024Updated last year
- Python3, NetworkX, Java, MLlib, Spark, Cassandra, Neo4j 3.0, Gephi, Docker☆11Jul 18, 2017Updated 8 years ago
- ☆20Jun 4, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for NAACL 2019 paper: "Bridging the Gap: Attending to Discontinuity in Identification of Multiword Expressions"☆17Nov 21, 2022Updated 3 years ago
- Datasets of Neuropsychological Language Tests in Brazilian Portuguese☆13Oct 14, 2025Updated 6 months ago
- A set of examples for basic audio data handling☆13Aug 15, 2020Updated 5 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆212Nov 10, 2022Updated 3 years ago
- ☆24Feb 3, 2026Updated 3 months ago
- ☆15Sep 2, 2023Updated 2 years ago
- 基于梅尔频谱的信号分类和识别☆23Mar 31, 2023Updated 3 years ago