glam-imperial / semantic_speech_emotion_recognitionView external linksLinks
This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103.02993.pdf
☆27Mar 18, 2021Updated 4 years ago
Alternatives and similar repositories for semantic_speech_emotion_recognition
Users that are interested in semantic_speech_emotion_recognition are comparing it to the libraries listed below
Sorting:
- TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20☆33Aug 10, 2020Updated 5 years ago
- Implementation of the multi-time-scale convolution layer used in the paper Multi-Time-Scale Convolution for Emotion Recognition from Spee…☆11Oct 22, 2019Updated 6 years ago
- PyTorch implementation of the models described in the IEEE ICASSP 2022 paper "Is cross-attention preferable to self-attention for multi-m…☆63Mar 29, 2025Updated 10 months ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆46Nov 3, 2021Updated 4 years ago
- ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations☆21Sep 21, 2025Updated 4 months ago
- Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in…☆52Sep 14, 2021Updated 4 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆140Jan 6, 2025Updated last year
- ☆17Nov 30, 2021Updated 4 years ago
- Light-SERNet: A lightweight fully convolutional neural network for speech emotion recognition☆83May 25, 2022Updated 3 years ago
- Code for our paper "Efficient Speech Emotion Recognition Using Multi-Scale CNN and Attention" (ICASSP 2021, co-first authorship)☆28Jun 8, 2021Updated 4 years ago
- Code for Speech Emotion Recognition with Co-Attention based Multi-level Acoustic Information☆164Nov 27, 2023Updated 2 years ago
- [ICASSP 2020] Speech Emotion Recognition with Dual-Sequence LSTM Architecture☆12Jan 17, 2025Updated last year
- Official implementation for the paper Exploring Wav2vec 2.0 fine-tuning for improved speech emotion recognition☆153Oct 26, 2021Updated 4 years ago
- ☆41Nov 14, 2022Updated 3 years ago
- ☆18May 7, 2020Updated 5 years ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆17Aug 26, 2025Updated 5 months ago
- A CNN audio classifier via spectrogram images.☆10Jul 21, 2017Updated 8 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 7 months ago
- HGFM : A Hierarchical Grained and Feature Model for Acoustic Emotion Recgnition☆11Oct 30, 2020Updated 5 years ago
- Repository for code and paper submitted for APSIPA 2019, Lanzhou, China☆21Aug 2, 2024Updated last year
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆17Feb 1, 2026Updated 2 weeks ago
- A Tensorflow implementation of Speech Emotion Recognition using Audio signals and Text Data☆12May 16, 2022Updated 3 years ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆27Mar 21, 2025Updated 10 months ago
- Code for the paper "JELLY: Joint Emotion Recognition and Context Reasoning with LLMs for Conversational Speech Synthesis"☆14Nov 5, 2024Updated last year
- Tensorflow Implementation for "Pre-trained Deep Convolution Neural Network Model With Attention for Speech Emotion Recognition"☆10Dec 19, 2021Updated 4 years ago
- Implementation of the paper: StyleBERT: Text-Audio Sentiment Analysis with Bi-directional Style Enhancement☆14Apr 10, 2023Updated 2 years ago
- Source code for paper Multi-Task Learning for Depression Detection in Dialogs (SIGDial 2022)☆12Jan 18, 2025Updated last year
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆14Jun 28, 2024Updated last year
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- End-to-end real-world polyphonic piano audio-to-score transcription with hierarchical decoding (IJCAI 2024)☆41Sep 17, 2024Updated last year
- Multimodal preprocessing on IEMOCAP dataset☆13Jun 8, 2018Updated 7 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Dec 20, 2020Updated 5 years ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- radiomixer☆14Feb 16, 2022Updated 4 years ago
- Toward Multi Modality Language Model - implementation of GPT-4o/Project Astra☆16Dec 10, 2024Updated last year
- SpeechPlus: Small LLM-Based Text-to-Speech Library 🚀☆20May 20, 2025Updated 8 months ago
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago