nhattruongpham/mmser

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nhattruongpham/mmser)

nhattruongpham / mmser

SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings

☆15

Alternatives and similar repositories for mmser

Users that are interested in mmser are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NeuroByte-Consulting / Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs
View on GitHub
Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…
☆12Apr 28, 2025Updated last year
LorenzoGianassi / Land-Diffuser
View on GitHub
The Land-Diffuser is a novel application of the Denoising Diffusion Probabilistic Model (DDPM) in the realm of 3D Talking Head generation…
☆13Dec 23, 2023Updated 2 years ago
HoseinAzad / Transformer-based-SER
View on GitHub
Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch
☆42Apr 12, 2024Updated 2 years ago
hwang9u / emocatcher
View on GitHub
[RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)
☆31Sep 29, 2023Updated 2 years ago
paladinarcher / padawan
View on GitHub
An application for developing developers.
☆14Dec 8, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
frankaging / Multimodal-Transformer
View on GitHub
Attention Based Multi-modal Emotion Recognition; Stanford Emotional Narratives Dataset
☆17Aug 21, 2019Updated 6 years ago
vaibhavsundharam / Speech-Emotion-Analysis
View on GitHub
Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…
☆25Jun 23, 2021Updated 5 years ago
adityahbapat / Respiratory-Disorder-Classification-Based-on-Lung-Auscultation-Sounds
View on GitHub
Respiratory Disorder Classification Based on Lung Auscultation sounds
☆13Oct 22, 2024Updated last year
HappyColor / SpeechFormer2
View on GitHub
SpeechFormer++ in PyTorch
☆51Jul 21, 2023Updated 3 years ago
Meatfucker / metatron2
View on GitHub
A Multimodal Discord bot with machine learning functions, including LLM chat, Image generation, and Speech Generation capabilities
☆12Jan 7, 2024Updated 2 years ago
PySYCL / PySYCL
View on GitHub
PySYCL is an open-source Python interface for SYCL.
☆15Apr 18, 2025Updated last year
JabuMlDev / Speaker-VGG-CCT
View on GitHub
Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…
☆25Feb 17, 2023Updated 3 years ago
jayanth-mkv / emotion2vec-speech-emotion-detection-api
View on GitHub
This API utilizes a pre-trained model for emotion recognition from audio files. It accepts audio files as input, processes them using the…
☆14Jul 19, 2026Updated last week
MengboLi / MS-SENet
View on GitHub
☆11Jul 16, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
surelybassy / SportStatsAnalysis
View on GitHub
Project exploring data collection, visualisation and analysis of Sports Statistics.
☆14Dec 17, 2020Updated 5 years ago
RauldeQueirozMendes / VSDataset
View on GitHub
Dataset for visual servoing and camera pose estimation. The images were obtained by a manipulator robot with an eye-in-hand camera in dif…
☆17Jun 21, 2021Updated 5 years ago
ShaheenPerveen / speech-emotion-recognition-iemocap
View on GitHub
Detect emotion from audio signals of IEMOCAP dataset using multi-modal approach. Utilized acoustic features, mel-spectrogram and text as …
☆41Mar 7, 2024Updated 2 years ago
bolajixi / Mulitimodal-Speech-Emotion-Recognition
View on GitHub
A Tensorflow implementation of Speech Emotion Recognition using Audio signals and Text Data
☆12May 16, 2022Updated 4 years ago
trecpodcasts / podcast-audio-feature-extraction
View on GitHub
Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.
☆12Sep 30, 2021Updated 4 years ago
stan-hua / CytoImageNet
View on GitHub
A large-scale pretraining dataset for bioimage transfer learning
☆18Jul 31, 2022Updated 3 years ago
sidleal / porsimplessent
View on GitHub
PorSimplesSent - A Portuguese corpus of aligned sentences pairs to investigate sentence readability assessment
☆13Jan 15, 2020Updated 6 years ago
Sakib1263 / VGG-1D-2D-Tensorflow-Keras
View on GitHub
Models Supported: VGG11, VGG13, VGG16, VGG16_v2, VGG19 (1D and 2D versions with DEMO for Classification and Regression).
☆17Nov 25, 2021Updated 4 years ago
Janie1996 / MSRFG
View on GitHub
The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations
☆11Jan 17, 2023Updated 3 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
eXascaleInfolab / fashion_nlp_v2
View on GitHub
FashionBrain D2.1: Named Entity Recognition and Linking Methods
☆11Jun 26, 2019Updated 7 years ago
kleberandrade / evolve-kart-unity
View on GitHub
Example of application of genetic algorithm for evolution kart navigation.
☆11Nov 21, 2019Updated 6 years ago
ETZET / SpeechEmotionAVLearning
View on GitHub
☆13Nov 25, 2023Updated 2 years ago
NLP-kr / tensorflow-ml-nlp-tf2-colab
View on GitHub
☆10Mar 6, 2022Updated 4 years ago
akshaypunwatkar / Sound_classification_urbansound8k
View on GitHub
Classification of Urban sounds using several classification methods, namely SVM, MLP and CNN using MFCC features.
☆13Apr 15, 2020Updated 6 years ago
bagustris / deep-mlp-ser
View on GitHub
Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition
☆11Oct 24, 2023Updated 2 years ago
uqzhichen / Awesome-compositional-zero-shot-learning
View on GitHub
Paper list of compositional zero-shot learning
☆11Jul 5, 2022Updated 4 years ago
cgaroufis / MSSPT
View on GitHub
Companion repository for the EUSIPCO-24 accepted paper "Pre-Training Music Classification Models via Music Source Separation"
☆12Aug 30, 2024Updated last year
NariFan2002 / AttA-NET
View on GitHub
ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION
☆14Sep 25, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
aws-cloud-clubs / 2024-aws-ml-camp-workshop-guide
View on GitHub
2024 AWS ML camp Workshop Studio Documentation (Translated in Korean)
☆17Jan 3, 2024Updated 2 years ago
Ahmed-Hereiz / Computer-Vision-Filters
View on GitHub
☆15Jun 8, 2023Updated 3 years ago
smartcameras / SelfCrossAttn
View on GitHub
PyTorch implementation of the models described in the IEEE ICASSP 2022 paper "Is cross-attention preferable to self-attention for multi-m…
☆67Mar 29, 2025Updated last year
rsforbes / pro_sports_transactions
View on GitHub
A Python Library for Consuming Transactions from Pro Sports Transactions (https://www.prosportstransactions.com)
☆22Updated this week
BGU-CS-VIL / Training-Free-VOS
View on GitHub
"From ViT Features to Training-free Video Object Segmentation via Streaming-data Mixture Models" [Uziel, Dinari, and Freifeld, NeurIPS 20…
☆14Jan 16, 2024Updated 2 years ago
aziele / statistical-distance
View on GitHub
Measures of distance between two probability density functions
☆17Mar 19, 2023Updated 3 years ago
clemkoa / u-net
View on GitHub
Simple pytorch implementation of the u-net model for image segmentation
☆15Feb 21, 2024Updated 2 years ago