In this work is proposed a speech emotion recognition model based on the extraction of four different features got from RAVDESS sound files and stacking the resulting matrices in a one-dimensional array by taking the mean values along the time axis. Then this array is fed into a 1-D CNN model as input.
☆10Feb 27, 2022Updated 4 years ago
Alternatives and similar repositories for Speech_emotion_recognition
Users that are interested in Speech_emotion_recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MFCC features + SVM for speech emotion classification☆16Oct 21, 2020Updated 5 years ago
- A simple, lightweight framework for head pose estimation☆24Jan 25, 2024Updated 2 years ago
- A tensorflow/keras implementation of StyleGAN to generate images of new Pokemon.☆20Dec 16, 2021Updated 4 years ago
- An naive anomaly detection and data visualization tool for F1 on board telemetry data.☆15Jun 17, 2022Updated 3 years ago
- A comprehensive list of OpenCV algorithms and Clustering approaches made from scratch and with detailed explanations☆32Jan 23, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Image Captioning using CNN and Transformer.☆55Nov 9, 2021Updated 4 years ago
- A simple yet useful tool built to extract only the alphanumerical characters from a license plate☆19Jul 22, 2020Updated 5 years ago
- Transformer-based model for Speech Emotion Recognition(SER) - implemented by Pytorch☆42Apr 12, 2024Updated 2 years ago
- Blog of the LibreCV.org☆11May 17, 2021Updated 4 years ago
- ☆13Oct 29, 2021Updated 4 years ago
- Advanced algorithms and data structures for competitive programming and computational research: LA/LCA, RMQ, perfect hashing, vEB/x-fast …☆25Oct 8, 2024Updated last year
- MergeNet-filter-ldr2hdr, detail in paper 《Reconstructing HDR Image from a Single Filtered LDR Image Base on a Deep HDR Merger Network》☆10Sep 11, 2019Updated 6 years ago
- InfAnFace: Bridging the infant-adult domain gap in facial landmark estimation in the wild (ICPR2022)☆14Jan 9, 2026Updated 3 months ago
- Challenges in Video-Based Infant Action Recognition: A Critical Examination of the State of the Art (WACVW'24)☆16Nov 20, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33May 18, 2022Updated 3 years ago
- Detects the presence of texts on your UIImage, slices the words in different exportable images together with the string detected (using T…☆29Jun 29, 2018Updated 7 years ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- ICMEW:A_Generative_Compression_Framework_For_Low_Bandwidth_Video_Conference☆10Dec 7, 2021Updated 4 years ago
- ☆17Oct 25, 2022Updated 3 years ago
- ☆18Oct 22, 2021Updated 4 years ago
- Build a recommendation engine with Spark and Watson Machine Learning☆46Feb 18, 2020Updated 6 years ago
- ☆16Nov 25, 2022Updated 3 years ago
- Detect and remove or lower the volume of breathing in speech recordings.☆14May 14, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Jul 24, 2022Updated 3 years ago
- Official Code Release for "Adapting Pre-trained Vision Transformers from 2D to 3D through Weight Inflation Improves Medical Image Segment…☆52Jun 8, 2023Updated 2 years ago
- Experiments with Neural Ordinary Differential Equations on image and text classification tasks☆33Mar 24, 2019Updated 7 years ago
- Calculate Spatial Information / Temporal Information according to ITU-T P.910☆17Dec 11, 2022Updated 3 years ago
- Codes for ACMMM 2021 paper "Fully Quantized Image Super-Resolution Networks".☆19Jul 25, 2021Updated 4 years ago
- ☆12Mar 23, 2026Updated 3 weeks ago
- XCORE-VOICE Solution☆19Apr 8, 2026Updated last week
- Use `outlines` generators with Haystack.☆15Updated this week
- ☆25Sep 27, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Automatically convert 2D medical images (DICOM) to 3D using VTK and python☆55Aug 30, 2022Updated 3 years ago
- Code for the paper: MACE: Leveraging Audio for Evaluating Audio Captioning Systems☆13Jan 16, 2025Updated last year
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Sep 18, 2023Updated 2 years ago
- Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav fi…☆21Sep 4, 2020Updated 5 years ago
- ☆20Dec 6, 2020Updated 5 years ago
- A Docker Wrapper to make the machine easily learn any language on top of INRIA OSCAR dataset using GPT2☆12Jan 30, 2020Updated 6 years ago
- ☆12Jan 28, 2022Updated 4 years ago