AndreaLombax / Speech_emotion_recognitionView external linksLinks
In this work is proposed a speech emotion recognition model based on the extraction of four different features got from RAVDESS sound files and stacking the resulting matrices in a one-dimensional array by taking the mean values along the time axis. Then this array is fed into a 1-D CNN model as input.
☆10Feb 27, 2022Updated 3 years ago
Alternatives and similar repositories for Speech_emotion_recognition
Users that are interested in Speech_emotion_recognition are comparing it to the libraries listed below
Sorting:
- MFCC features + SVM for speech emotion classification☆16Oct 21, 2020Updated 5 years ago
- A simple, lightweight framework for head pose estimation☆23Jan 25, 2024Updated 2 years ago
- An naive anomaly detection and data visualization tool for F1 on board telemetry data.☆15Jun 17, 2022Updated 3 years ago
- A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.☆20Dec 16, 2021Updated 4 years ago
- A comprehensive list of OpenCV algorithms and Clustering approaches made from scratch and with detailed explanations☆32Jan 23, 2024Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33May 18, 2022Updated 3 years ago
- XCORE-VOICE Solution☆17Jun 12, 2025Updated 8 months ago
- This project is to develop a named entity recognition (NER) model to identity medical entities such as diseases, symptoms, treatments in…☆12Oct 15, 2024Updated last year
- eCMU: An Efficient Phase-aware Framework for Music Source Separation with Conformer (IEEE RIVF23)☆10Oct 30, 2024Updated last year
- InfAnFace: Bridging the infant-adult domain gap in facial landmark estimation in the wild (ICPR2022)☆13Jan 9, 2026Updated last month
- Use `outlines` generators with Haystack.☆15Updated this week
- Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch☆42Apr 12, 2024Updated last year
- ☆10Oct 16, 2025Updated 4 months ago
- Blog of the LibreCV.org☆11May 17, 2021Updated 4 years ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- A Docker Wrapper to make the machine easily learn any language on top of INRIA OSCAR dataset using GPT2☆12Jan 30, 2020Updated 6 years ago
- Image Captioning using CNN and Transformer.☆54Nov 9, 2021Updated 4 years ago
- MergeNet-filter-ldr2hdr, detail in paper 《Reconstructing HDR Image from a Single Filtered LDR Image Base on a Deep HDR Merger Network》☆10Sep 11, 2019Updated 6 years ago
- ☆12Jan 28, 2022Updated 4 years ago
- An evaluation set for large-scale trained TTS models (Coming in Sep 2024)☆12Sep 2, 2024Updated last year
- ☆10Jul 25, 2023Updated 2 years ago
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Feb 17, 2024Updated 2 years ago
- ICMEW:A_Generative_Compression_Framework_For_Low_Bandwidth_Video_Conference☆10Dec 7, 2021Updated 4 years ago
- ☆13Mar 7, 2023Updated 2 years ago
- ☆13Oct 29, 2021Updated 4 years ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- Deep Learning Capstone Project. Live camera app that can interpret number strings in real-world images.☆14May 30, 2016Updated 9 years ago
- Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art (WACVW'24)☆17Nov 20, 2023Updated 2 years ago
- A lightweight Python library for running TTS models with a unified API.☆21Feb 18, 2025Updated 11 months ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- ☆16Nov 25, 2022Updated 3 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- Diffusion Model for Voice Conversion☆69Mar 14, 2024Updated last year
- Sophia AI Assistant is a Python-based desktop AI that performs a variety of tasks, including answering questions, opening applications, b…☆28Oct 18, 2024Updated last year
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- I'm building an end-to-end Vietnamese Speech Recognition System. I'll deploy it into production with the help of Flask, Uwsgi, Nginx, and…☆17Sep 9, 2022Updated 3 years ago
- Calculate Spatial Information / Temporal Information according to ITU-T P.910☆17Dec 11, 2022Updated 3 years ago
- ☆19Jan 10, 2025Updated last year
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 6 months ago