A collection of dataset consists of a total of 8 English speech datasets for SER
☆32Jan 8, 2025Updated last year
Alternatives and similar repositories for Combined_Dataset_for_Speech_Emotion_Recognition
Users that are interested in Combined_Dataset_for_Speech_Emotion_Recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.☆17Jun 12, 2024Updated last year
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆11Oct 20, 2023Updated 2 years ago
- brainless concatenative text to speech☆14May 11, 2021Updated 4 years ago
- [INTERSPEECH 2024] EmoBox: Multilingual Multi-corpus Speech Emotion Recognition Toolkit and Benchmark☆316Mar 18, 2026Updated last month
- OpenAI Whisper demo on Axera☆15Jan 15, 2026Updated 3 months ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Source code for paper "Breaking Security-Critical Voice Authentication".☆13Jul 10, 2023Updated 2 years ago
- ☆13Jul 10, 2021Updated 4 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆411Sep 30, 2024Updated last year
- ☆14Aug 19, 2024Updated last year
- Tools for the automatic detection of speech-related inhalation events and characterisation of the speech respiratory cycle.☆11Feb 17, 2024Updated 2 years ago
- Pytorch implementation of MoLA☆22Jun 9, 2025Updated 10 months ago
- Respiratory Disorder Classification Based on Lung Auscultation sounds☆13Oct 22, 2024Updated last year
- End-To-End SpeechSynthesis system with knowledge distillation☆18Jul 16, 2022Updated 3 years ago
- A versatile, easily configurable vocoder software in MATLAB, for research purposes☆14Apr 9, 2021Updated 5 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Human age estimation using deep neural networks (Keras)☆14Aug 10, 2023Updated 2 years ago
- used to evaluate wavenet vocoder by rmse f0, MCD, rmse ap...☆15Jan 20, 2020Updated 6 years ago
- Python interface to Optotune focus-tunable lenses☆13Feb 4, 2020Updated 6 years ago
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆21Nov 19, 2024Updated last year
- Utility to mass-download a Twitch streamer's clips. Allows both local storage, as well as directly upload to Google Drive☆12Dec 20, 2023Updated 2 years ago
- Vocal Tract Modelling by Murphy, Shelley and Ternström☆17Nov 13, 2022Updated 3 years ago
- A fully and partially fake speech dataset for evaluation☆15Nov 11, 2025Updated 5 months ago
- This repo summarizes the courses and materials for speech signal processing. You are kindly invited to pull requests.☆99Jul 20, 2020Updated 5 years ago
- CLASP: Contrastive Language-Speech Pretraining for Multilingual Multimodal Information Retrieval☆13Jun 27, 2025Updated 9 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- English to IPA with syllable correspondence☆13Aug 23, 2022Updated 3 years ago
- Interoperability for Grasshopper and Revit☆22Aug 18, 2017Updated 8 years ago
- This model is designed Using GMM and MFCC and tested with Hindi/English audio samples with a good resultant accuracy.☆15Jun 10, 2020Updated 5 years ago
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆185Feb 28, 2026Updated last month
- ⚡️Official Image-charts Python library☆12Apr 2, 2026Updated 2 weeks ago
- (INTERSPEECH 2024) Official Implementation of "BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classificatio…☆25Jul 10, 2025Updated 9 months ago
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆20Nov 15, 2022Updated 3 years ago
- A three-dimensional vocal tract acoustic model using the finite-difference time-domain (FDTD) numerical scheme.☆18Sep 25, 2022Updated 3 years ago
- For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…☆11Oct 29, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Reference-aware automatic speech evaluation toolkit☆181Dec 5, 2024Updated last year
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆20May 12, 2023Updated 2 years ago
- A Python project for generating concatenative synthesis driven representations of audio files based on audio database analysis.☆18Feb 13, 2018Updated 8 years ago
- A rough and ready Python utility which splits audio files based on silence and desired min/max chunk duration.☆16Jun 22, 2022Updated 3 years ago
- ☆15Jun 15, 2022Updated 3 years ago
- Repo for hosting tutorial code associated with the "AssemblyAI and Python in 5 Minutes" blog by AssemblyAI☆12Jul 29, 2023Updated 2 years ago
- ☆12Oct 17, 2024Updated last year