kjy7567/speech_emotion_recognition_from_log_Mel_spectrogram_using_vertically_long_patch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kjy7567/speech_emotion_recognition_from_log_Mel_spectrogram_using_vertically_long_patch)

kjy7567 / speech_emotion_recognition_from_log_Mel_spectrogram_using_vertically_long_patch

speech emotion recognition from log mel spectrogram

☆31

Alternatives and similar repositories for speech_emotion_recognition_from_log_Mel_spectrogram_using_vertically_long_patch

Users that are interested in speech_emotion_recognition_from_log_Mel_spectrogram_using_vertically_long_patch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HappyColor / Vesper
View on GitHub
A Compact and Effective Pretrained Model for Speech Emotion Recognition
☆54Apr 10, 2026Updated 3 months ago
ZenvilleErasmus / RAVDESS-emotions-speech-audio-only
View on GitHub
1,440 audio files (.wav), i.e. speech files, from 24 actors that are categorized into 8 separate emotions.
☆15Feb 11, 2019Updated 7 years ago
liuhuadai / ViT-TTS
View on GitHub
PyTorch Implementation of ViT-TTS (EMNLP'23)
☆11Oct 20, 2023Updated 2 years ago
NeuroByte-Consulting / Speech-Emotion-Recognition-in-Tensorflow-Using-CNNs
View on GitHub
Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…
☆12Apr 28, 2025Updated last year
LuluW8071 / Automatic-Speech-Recognition-with-PyTorch
View on GitHub
Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightning⚡
☆11Jan 23, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jh-cha-prml / JELLY
View on GitHub
Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"
☆14Nov 5, 2024Updated last year
Annafavaro / PARKCELEB
View on GitHub
☆11Jun 13, 2026Updated last month
muramasa2 / paper_summary
View on GitHub
☆13Jul 10, 2021Updated 5 years ago
mmmnhjgh / lark-smart-meeting-assistant
View on GitHub
智能会议助手：自动获取会议纪要、提取待办事项、创建任务、发送会议总结。飞书 CLI 创作者大赛参赛作品。
☆21Apr 14, 2026Updated 3 months ago
shui-dun / multimodal_ad
View on GitHub
☆11Jul 14, 2023Updated 3 years ago
YuqiZhang-Buaa / Mamba2MIL
View on GitHub
☆11Sep 30, 2024Updated last year
mmakiuchi / multimodal_emotion_recognition
View on GitHub
Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in…
☆52Sep 14, 2021Updated 4 years ago
adesgautam / clip-search
View on GitHub
A search engine implementation using OpenAI's clip model
☆10Jun 20, 2021Updated 5 years ago
SteveKGYang / SCCL
View on GitHub
Pytorch code for TAC accepted paper: "Cluster-Level Contrastive Learning for Emotion Recognition in Conversations"
☆26Apr 16, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
exeex / vocoder_eva
View on GitHub
used to evaluate wavenet vocoder by rmse f0, MCD, rmse ap...
☆15Jan 20, 2020Updated 6 years ago
bagustris / ssl-ser
View on GitHub
Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"
☆10Mar 15, 2023Updated 3 years ago
LIN-SHANG / InstructERC
View on GitHub
The offical realization of InstructERC
☆151May 25, 2025Updated last year
NUS-HPC-AI-Lab / DyVM
View on GitHub
☆18Apr 8, 2025Updated last year
kyegomez / AudioMamba
View on GitHub
Implementation of the paper: "Audio Mamba: Bidirectional State Space Model for Audio Representation Learning" in pytorch
☆15Updated this week
ristea / septr
View on GitHub
☆29Sep 29, 2022Updated 3 years ago
audeering / w2v2-how-to
View on GitHub
How to use our public wav2vec2 dimensional emotion model
☆555May 22, 2023Updated 3 years ago
lovasoa / seamcarving
View on GitHub
Seam carving implemented in rust
☆12Apr 19, 2020Updated 6 years ago
lcn-kul / madress-2023
View on GitHub
Source code for LCN submission for ADReSS-M challenge (formerly called MADReSS).
☆14Jun 1, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Vincent-ZHQ / CA-MSER
View on GitHub
Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information
☆163Nov 27, 2023Updated 2 years ago
SmoothKen / knn-svc
View on GitHub
kNN-SVC: Robust Zero-Shot Singing Voice Conversion with Additive Synthesis and Concatenation Smoothness Optimization
☆16Nov 7, 2025Updated 8 months ago
ekg / pca
View on GitHub
PCA in rust
☆16Jul 30, 2023Updated 2 years ago
kaen2891 / bts
View on GitHub
(INTERSPEECH 2024) Official Implementation of "BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classificatio…
☆25Jul 10, 2025Updated last year
rendchevi / daisy-tts
View on GitHub
🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
☆14Nov 15, 2025Updated 8 months ago
pablovin / FaceChannel
View on GitHub
The FaceChannel model for facial expression recognition.
☆20Jun 24, 2024Updated 2 years ago
sarulab-speech / Coco-Nut
View on GitHub
Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus
☆21Jun 12, 2024Updated 2 years ago
ibliever / Cross-modal-information-fusion-for-voice-spoofing-detection
View on GitHub
This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"
☆13Jun 5, 2023Updated 3 years ago
LiuYuML / NT-VOT211
View on GitHub
[ACCV 2024 (Oral, Best Application Paper)] Official Implementation of NT-VOT211: A Large-Scale Benchmark for Night-time Visual Object Tra…
☆16Dec 30, 2025Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Mwxinnn / UniAS
View on GitHub
The official repo for ”[WACV2025] Towards Accurate Unified Anomaly Segmentation“
☆15Apr 14, 2025Updated last year
takamichi-lab / speech-audio-proccessing
View on GitHub
"Speech and audio processing", lectures of graduate school, Keio University, Japan
☆20Dec 13, 2025Updated 7 months ago
ASolitaryMan / HFLEA
View on GitHub
FRAME-LEVEL EMOTIONAL STATE ALIGNMENT METHOD FOR SPEECH EMOTION RECOGNITION
☆23Dec 22, 2024Updated last year
zlab-princeton-internal / writing-guide
View on GitHub
Paper writing guide for Zhuang Liu Lab @ Princeton University
☆16Jun 24, 2026Updated 3 weeks ago
Yuto-Matsunaga / Prompt_Tuning_for_Audio_Deepfake_Detection
View on GitHub
☆13Nov 12, 2024Updated last year
PhucNDA / HA-RDet
View on GitHub
Hybrid-Anchor Rotation Detector for Oriented Object Detection (ICCV'25)
☆18Aug 11, 2025Updated 11 months ago
ZhiqiWang12-hash / text_audio_classification
View on GitHub
Chinese BERT classification with tf2.0 and audio classification with mfcc
☆14Dec 2, 2020Updated 5 years ago