talhanai / speech-nlp-datasetsLinks
Contains links to publicly available datasets for modeling health outcomes using speech and language.
☆125Updated last year
Alternatives and similar repositories for speech-nlp-datasets
Users that are interested in speech-nlp-datasets are comparing it to the libraries listed below
Sorting:
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆133Updated 3 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆137Updated 9 months ago
- Data repository of Project Coswara☆193Updated 2 years ago
- A collection of datasets for the purpose of emotion recognition/detection in speech.☆381Updated last year
- feature extraction from speech signals☆382Updated 4 months ago
- Simple, straight-forward extraction of acoustic and prosodic features from sound waves based on Praat and Parselmouth.☆27Updated 6 years ago
- ☆138Updated last year
- Time series course Fall 2019 project☆53Updated 5 years ago
- ☆110Updated 3 years ago
- A unified dataset of multilingual emotional human utterances☆28Updated 3 years ago
- The code for our INTERSPEECH 2020 paper - Jointly Fine-Tuning "BERT-like'" Self Supervised Models to Improve Multimodal Speech Emotion R…☆120Updated 4 years ago
- Python package for openSMILE☆296Updated this week
- ☆30Updated 3 years ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Updated 2 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆267Updated 3 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 5 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆90Updated last year
- Multilingual datasets with raw audio for speech emotion recognition☆29Updated 4 years ago
- The official repository for Audio ALBERT☆67Updated 3 years ago
- openXBOW - the Passau Open-Source Crossmodal Bag-of-Words Toolkit☆82Updated 4 years ago
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- A collection of Audio and Speech pre-trained models.☆193Updated 5 years ago
- Spot the conversation: speaker diarisation in the wild☆151Updated 3 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆66Updated 4 years ago
- Urdu Language Speech Emotional Corpus☆46Updated 6 years ago
- A Python library for measuring the acoustic features of speech (simultaneous speech, high entropy) compared to ones of native speech.☆263Updated 2 years ago
- This repository contains PyTorch implementation of 4 different models for classification of emotions of the speech.☆205Updated 2 years ago
- Implementation of the paper "Improved End-to-End Speech Emotion Recognition Using Self Attention Mechanism and Multitask Learning" From I…☆57Updated 4 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆44Updated 5 years ago
- ☆52Updated 4 years ago