JabuMlDev/Speaker-VGG-CCT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JabuMlDev/Speaker-VGG-CCT)

JabuMlDev / Speaker-VGG-CCT

Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transformers, 2022"

☆25

Alternatives and similar repositories for Speaker-VGG-CCT

Users that are interested in Speaker-VGG-CCT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xiaomi1024 / code_SAMS
View on GitHub
☆13Jan 11, 2024Updated 2 years ago
Vincent-ZHQ / CA-MSER
View on GitHub
Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information
☆163Nov 27, 2023Updated 2 years ago
HoseinAzad / Transformer-based-SER
View on GitHub
Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch
☆42Apr 12, 2024Updated 2 years ago
NariFan2002 / AttA-NET
View on GitHub
ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION
☆14Sep 25, 2023Updated 2 years ago
ASolitaryMan / HFLEA
View on GitHub
FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION
☆23Dec 22, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
HappyColor / SpeechFormer2
View on GitHub
SpeechFormer++ in PyTorch
☆51Jul 21, 2023Updated 3 years ago
bagustris / ssl-ser
View on GitHub
Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"
☆10Mar 15, 2023Updated 3 years ago
fotisdr / DNN-HA
View on GitHub
DNN-based hearing aid for real-time sound processing
☆25May 25, 2023Updated 3 years ago
HappyColor / Vesper
View on GitHub
A Compact and Effective Pretrained Model for Speech Emotion Recognition
☆55Apr 10, 2026Updated 3 months ago
ECNU-Cross-Innovation-Lab / ENT
View on GitHub
[ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition
☆28Apr 11, 2024Updated 2 years ago
shaokai1209 / MDSA
View on GitHub
[IEEE, TASLP, 2023] The code of the paper "Multi-Source Discriminant Subspace Alignment for Cross-Domain Speech Emotion Recognition".
☆19Sep 27, 2024Updated last year
X-LANCE / LSCodec-Inference
View on GitHub
Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"
☆36Oct 23, 2025Updated 9 months ago
bubaimaji / cmt-mser
View on GitHub
"MULTIMODAL EMOTION RECOGNITION BASED ON DEEP TEMPORAL FEATURES USING CROSS-MODAL TRANSFORMER AND SELF-ATTENTION" ICASSP'23
☆24Feb 26, 2023Updated 3 years ago
Sreyan88 / MMER
View on GitHub
Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition
☆83Mar 12, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mmakiuchi / multimodal_emotion_recognition
View on GitHub
Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in…
☆52Sep 14, 2021Updated 4 years ago
AryaAftab / LIGHT-SERNET
View on GitHub
Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition
☆83May 25, 2022Updated 4 years ago
Emrys365 / torch_stft
View on GitHub
PyTorch-based implementations of short-time Fourier transform
☆14Jul 21, 2025Updated last year
XingqunQi-lab / EmotionGestures
View on GitHub
Data and Pytorch implementation of IEEE TMM "EmotionGesture: Audio-Driven Diverse Emotional Co-Speech 3D Gesture Generation"
☆31Mar 21, 2024Updated 2 years ago
AndreevP / speech_distances
View on GitHub
Deep Speech Distances PyTorch
☆29Feb 21, 2022Updated 4 years ago
Jungjee / INTERSPEECH2023_T6
View on GitHub
Advances in audio anti-spoofing and deepfake detection using graph neural networks and self-supervised learning
☆23Aug 20, 2023Updated 2 years ago
kaen2891 / adversarial_fine-tuning_using_generated_respiratory_sound
View on GitHub
(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…
☆19Dec 5, 2024Updated last year
vaibhavsundharam / Speech-Emotion-Analysis
View on GitHub
Human emotions are one of the strongest ways of communication. Even if a person doesn’t understand a language, he or she can very well u…
☆25Jun 23, 2021Updated 5 years ago
ThomasFeher / oms
View on GitHub
octave multi-channel signal processing
☆10May 11, 2014Updated 12 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
yahuiliu99 / PointConT
View on GitHub
Official implementation of the paper "Point Cloud Classification Using Content-based Transformer via Clustering in Feature Space"
☆38Jun 1, 2023Updated 3 years ago
scutcsq / DWFormer
View on GitHub
DWFormer: Dynamic Window Transformer for Speech Emotion Recognition(ICASSP 2023 Oral)
☆69Jul 8, 2024Updated 2 years ago
TheKangChen / crosstalk-cancellation
View on GitHub
Binaural audio reproduction through loudspeakers. Also known as crosstalk cancellation.
☆12Sep 12, 2024Updated last year
PySYCL / PySYCL
View on GitHub
PySYCL is an open-source Python interface for SYCL.
☆15Apr 18, 2025Updated last year
tomastokar / Additive-Margin-Softmax
View on GitHub
Pytorch implementation of additive margin softmax loss
☆12Aug 5, 2021Updated 4 years ago
AnkushMalaker / pretrained-dcnn-attention-ser
View on GitHub
Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"
☆10Dec 19, 2021Updated 4 years ago
AnkushMalaker / speech-emotion-recognition
View on GitHub
Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.
☆13Dec 18, 2021Updated 4 years ago
a791702141 / SSG
View on GitHub
This project is the official implementation of ``Self-Supervised Graph Neural Network for Multi-Source Domain Adaptation'' in PyTorch, wh…
☆12Nov 4, 2022Updated 3 years ago
AliceOTHMANI / EmoAudioNet
View on GitHub
Here the code of EmoAudioNet is a deep neural network for speech classification (published in ICPR 2020)
☆14Jul 13, 2020Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
hagenw / amtoolbox
View on GitHub
Mirror of the Auditory Modelling Toolbox http://amtoolbox.sourceforge.net/
☆11Jan 28, 2019Updated 7 years ago
egaudrain / vocoder
View on GitHub
A versatile, easily configurable vocoder software in MATLAB, for research purposes
☆15Apr 9, 2021Updated 5 years ago
thevasudevgupta / gsoc-wav2vec2
View on GitHub
GSoC'2021 | TensorFlow implementation of Wav2Vec2
☆91Jan 11, 2022Updated 4 years ago
trecpodcasts / podcast-audio-feature-extraction
View on GitHub
Audio feature extraction and baseline search implementation for the Spotify Podcast Dataset.
☆12Sep 30, 2021Updated 4 years ago
aascode / Speech-Emotion-Recognition-2
View on GitHub
Speech emotion recognition using LSTM, SVM and MLP | 语音情感识别
☆10Jul 1, 2019Updated 7 years ago
nixiieee / RAVEN
View on GitHub
RAVEN: Recognition of Audio-Visual Emotional Nuances - a project on building multimodal emotion recognition system
☆16Jun 24, 2025Updated last year
SubramaniKrishna / point-cloud-audio
View on GitHub
Accompanying code for our paper "Point Cloud Audio Processing"
☆18Jul 1, 2021Updated 5 years ago