SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
☆15Jan 23, 2024Updated 2 years ago
Alternatives and similar repositories for mmser
Users that are interested in mmser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Tensorflow implementation of Speech Emotion Recognition using Audio signals and Text Data☆12May 16, 2022Updated 3 years ago
- The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…☆13Dec 23, 2023Updated 2 years ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆33Sep 29, 2023Updated 2 years ago
- Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…☆25Jun 23, 2021Updated 4 years ago
- Respiratory Disorder Classification Based on Lung Auscultation sounds☆13Oct 22, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- SpeechFormer++ in PyTorch☆50Jul 21, 2023Updated 2 years ago
- A Multimodal Discord bot with machine learning functions, including LLM chat, Image generation, and Speech Generation capabilities☆12Jan 7, 2024Updated 2 years ago
- Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…☆25Feb 17, 2023Updated 3 years ago
- Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …☆41Mar 7, 2024Updated 2 years ago
- Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.☆12Sep 30, 2021Updated 4 years ago
- ☆11Jul 16, 2024Updated last year
- Example of application of genetic algorithm for evolution kart navigation.☆11Nov 21, 2019Updated 6 years ago
- ☆12Nov 25, 2023Updated 2 years ago
- ☆17Aug 15, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Companion repository for the EUSIPCO-24 accepted paper "Pre-Training Music Classification Models via Music Source Separation"☆12Aug 30, 2024Updated last year
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Oct 24, 2023Updated 2 years ago
- Classification of Urban sounds using several classification methods, namely SVM, MLP and CNN using MFCC features.☆13Apr 15, 2020Updated 5 years ago
- Paper list of compositional zero-shot learning☆11Jul 5, 2022Updated 3 years ago
- "From ViT Features to Training-free Video Object Segmentation via Streaming-data Mixture Models" [Uziel, Dinari, and Freifeld, NeurIPS 20…☆13Jan 16, 2024Updated 2 years ago
- ☆10Aug 16, 2024Updated last year
- ☆15Jun 8, 2023Updated 2 years ago
- PyTorch implementation of the models described in the IEEE ICASSP 2022 paper "Is cross-attention preferable to self-attention for multi-m…☆64Mar 29, 2025Updated last year
- ☆13Oct 22, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Motion U-Net is multi-cue autoencoder deep architecture for robust moving object detection☆15Jul 19, 2023Updated 2 years ago
- A URDF Version of MANO hand.☆17Nov 25, 2022Updated 3 years ago
- O primeiro e mais querido podcast sobre ciência de dados no Brasil☆14Jun 21, 2023Updated 2 years ago
- [TOMM 2023] Emotion recognition methods through facial expression, speeches, audios, and multimodal data☆19Oct 25, 2023Updated 2 years ago
- decision support system for robotic sampling in precision agriculture☆14Dec 13, 2018Updated 7 years ago
- ☆16Jan 30, 2019Updated 7 years ago
- Random collection of code snippets used to create Deep Fakes☆15Jun 30, 2019Updated 6 years ago
- ☆12Nov 10, 2024Updated last year
- Tunable-Q Wavelet Transform and Resonance-based Signal Decomposition Toolkit☆33Mar 20, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆14Oct 10, 2024Updated last year
- A multimodal fine-grained correlation fusion network with attention mechanisms for visual-textual sentiment analysis☆10Jan 13, 2024Updated 2 years ago
- ☆20Jun 4, 2024Updated last year
- Datasets of Neuropsychological Language Tests in Brazilian Portuguese☆13Oct 14, 2025Updated 5 months ago
- A set of examples for basic audio data handling☆13Aug 15, 2020Updated 5 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆212Nov 10, 2022Updated 3 years ago
- ☆18Jan 24, 2022Updated 4 years ago