☆12Nov 25, 2023Updated 2 years ago
Alternatives and similar repositories for SpeechEmotionAVLearning
Users that are interested in SpeechEmotionAVLearning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…☆27Feb 27, 2026Updated 3 weeks ago
- ☆10Aug 16, 2024Updated last year
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆21Aug 9, 2023Updated 2 years ago
- MSP-Podcast Challenge Baseline Code☆31Jun 12, 2024Updated last year
- Python Implementation of Visual Relative Attributes for Image Classification and Zero Shot Learning☆22Jun 14, 2018Updated 7 years ago
- Interface for Controllable Expressive Talking Machine☆40Sep 20, 2025Updated 6 months ago
- ☆112Aug 10, 2022Updated 3 years ago
- Exercises for EOPL in Haskell☆13Mar 9, 2026Updated 2 weeks ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…☆12Jun 22, 2022Updated 3 years ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆33Sep 29, 2023Updated 2 years ago
- Official repository for U-SAM (Interspeech 2025)☆26Jun 3, 2025Updated 9 months ago
- Speech Emotion Recognition (SER) in Tensorflow using CNNs and CRNNs Based on Mel Spectrograms and Mel Frequency Cepstral Coefficients (MF…☆12Apr 28, 2025Updated 10 months ago
- Repository for my paper: Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition☆11Oct 24, 2023Updated 2 years ago
- Blind JPEG Artifacts Removal via Enhanced Swin-Conv-UNet☆10Mar 25, 2024Updated last year
- ☆10Jul 16, 2024Updated last year
- The code for Multi-Scale Receptive Field Graph Model for Emotion Recognition in Conversations☆11Jan 17, 2023Updated 3 years ago
- ☆17Aug 15, 2024Updated last year
- Pytorch code for One Step Diffusion-based Super-Resolution with Time-Aware Distillation☆15Apr 28, 2025Updated 10 months ago
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Nov 25, 2022Updated 3 years ago
- Repository for my paper: Evaluation of Error and Correlation-Based Loss Functions For Multitask Learning Dimensional Speech Emotion Recog…☆20Mar 13, 2024Updated 2 years ago
- Repository for my paper: Dimensional Speech Emotion Recognition Using Acoustic Features and Word Embeddings using Multitask Learning☆17Aug 2, 2024Updated last year
- Implementation of "Look, Listen and Recognise:character-aware audio-visual subtitling"☆19Nov 3, 2025Updated 4 months ago
- AI-Generated Video Detection via Perceptual Straightening (NeurIPS2025)☆31Jan 2, 2026Updated 2 months ago
- ATTENTION AGGREGATION NETWORK FOR AUDIO-VISUAL EMOTION RECOGNITION☆13Sep 25, 2023Updated 2 years ago
- How to use our public wav2vec2 dimensional emotion model☆542May 22, 2023Updated 2 years ago
- This repo contains the code to reproduce the paper: "Enriched Music Representations with Multiple Cross-modal Contrastive Learning"☆15Jun 22, 2023Updated 2 years ago
- JEPAs for audio representation learning☆19Jun 22, 2025Updated 9 months ago
- Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"☆68Dec 13, 2021Updated 4 years ago
- Dimensional Emotion Detection from Categorical Emotion Annotation☆55Sep 23, 2021Updated 4 years ago
- EMO-SUPERB submission☆51Oct 13, 2025Updated 5 months ago
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆53Jun 29, 2024Updated last year
- 【Star us and watch this project grow! 🌱⭐️】A Spring Boot-based e-commerce microservices template with comprehensive setup guides. Ideal f…☆20Jun 28, 2025Updated 8 months ago
- AgentProg: Empowering Long-Horizon GUI Agents with Program-Guided Context Management☆25Mar 17, 2026Updated last week
- An Android app that listens to conversations and determines who was speaking at any point in the conversation - a task known as speech di…☆14Apr 12, 2021Updated 4 years ago
- A python implementation of a simple Unit Selection Text-to-Speech (TTS) synthesis system. It works with CMU-Arctic data by default☆11Mar 14, 2015Updated 11 years ago
- ☆17Nov 26, 2024Updated last year
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆56Oct 19, 2025Updated 5 months ago
- ☆13Aug 21, 2022Updated 3 years ago