☆41Nov 14, 2022Updated 3 years ago
Alternatives and similar repositories for Compact_SER
Users that are interested in Compact_SER are comparing it to the libraries listed below
Sorting:
- ☆28Nov 14, 2022Updated 3 years ago
- This repository contains the code for our ICASSP paper `Speech Emotion Recognition using Semantic Information` https://arxiv.org/pdf/2103…☆27Mar 18, 2021Updated 4 years ago
- TensorFlow implementation of "Attentive Modality Hopping for Speech Emotion Recognition," ICASSP-20☆33Aug 10, 2020Updated 5 years ago
- Speech emotion recognition using LSTM, SVM and MLP | 语音情感识别☆10Jul 1, 2019Updated 6 years ago
- A real-time voice conversion model based on VITS.☆14Aug 1, 2024Updated last year
- Repository for code and paper submitted for APSIPA 2019, Lanzhou, China☆21Aug 2, 2024Updated last year
- [ICTC'24] - "Voice-Based Age and Gender Recognition: A Comparative Study of LSTM, RezoNet and Hybrid CNNs-BiLSTM Architecture" by Nhut Mi…☆10Jan 16, 2025Updated last year
- Implementation of the multi-time-scale convolution layer used in the paper Multi-Time-Scale Convolution for Emotion Recognition from Spee…☆11Oct 22, 2019Updated 6 years ago
- Logistics Regression and Support Vector Machine using PyTorch☆12Feb 11, 2019Updated 7 years ago
- It is an implementation of research paper with title 'Multimodal deep networks for text and image-based document classification'☆13Jul 31, 2021Updated 4 years ago
- Implementation of the paper "Real-Time Emotion Recognition via Attention Gated Hierarchical Memory Network" in AAAI-2020.☆31Sep 2, 2022Updated 3 years ago
- ☆112Aug 10, 2022Updated 3 years ago
- Chinese BERT classification with tf2.0 and audio classification with mfcc☆14Dec 2, 2020Updated 5 years ago
- [ICASSP 2023] Official Tensorflow implementation of "Temporal Modeling Matters: A Novel Temporal Emotional Modeling Approach for Speech E…☆187May 15, 2024Updated last year
- ☆17Nov 30, 2021Updated 4 years ago
- User Emotion Recognition and Response Generation in Dialogue Text☆42Apr 16, 2021Updated 4 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆46Nov 3, 2021Updated 4 years ago
- ☆18May 7, 2020Updated 5 years ago
- Multimodal (text, acoustic, visual) Sentiment Analysis and Emotion Recognition on CMU-MOSEI dataset.☆29Nov 8, 2020Updated 5 years ago
- Multi-modal Speech Emotion Recogniton on IEMOCAP dataset☆95Jul 6, 2023Updated 2 years ago
- [INTERSPEECH 2023] Semi-supervised Learning for Speech Emotion Recognition On Federated Learning using Multiview Pseudo-Labeling☆25Sep 17, 2022Updated 3 years ago
- [ICASSP 2024] Emotion Neural Transducer for Fine-Grained Speech Emotion Recognition☆27Apr 11, 2024Updated last year
- Waste detection using YOLOv5 presents a promising approach for automating waste identification, classification, and localization. Its rea…☆14Nov 27, 2023Updated 2 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆140Jan 6, 2025Updated last year
- This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.☆47Apr 14, 2025Updated 10 months ago
- ☆10Jul 12, 2022Updated 3 years ago
- A list of pain recognition databases that are publicly available for research☆91May 12, 2021Updated 4 years ago
- ☆35Apr 14, 2023Updated 2 years ago
- ☆41Jan 13, 2022Updated 4 years ago
- Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition☆81Mar 12, 2024Updated last year
- Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)☆442Dec 21, 2023Updated 2 years ago
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆13Nov 28, 2024Updated last year
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago
- This is my speaker recognition implementation based on the x-vector system described in "X-Vectors: Robust DNN Embeddings for Speaker Rec…☆10Nov 3, 2022Updated 3 years ago
- WaveNet auto-ancoders for ZeroSpeech challenge 2020☆37Apr 7, 2022Updated 3 years ago
- This repository shows how to implement a basic model for multimodal entailment.☆10Aug 17, 2021Updated 4 years ago
- 23206-final-pose-estimation-for-swing-improvement created by GitHub Classroom☆18Dec 15, 2023Updated 2 years ago
- ☆10Jul 24, 2019Updated 6 years ago
- Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed fr…☆11May 2, 2021Updated 4 years ago