Contains links to publicly available datasets for modeling health outcomes using speech and language.
☆129Jun 10, 2024Updated last year
Alternatives and similar repositories for speech-nlp-datasets
Users that are interested in speech-nlp-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Templates I use for Competitive Programming (CP).☆11Nov 1, 2020Updated 5 years ago
- scripts to model depression in speech and text☆74Jan 22, 2020Updated 6 years ago
- Matlab tools for pathological voice analysis☆14May 12, 2023Updated 2 years ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Sep 6, 2024Updated last year
- Official implementation for AVGN☆40Mar 24, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- feature extraction from speech signals☆395Jun 15, 2025Updated 9 months ago
- ☆18Nov 15, 2021Updated 4 years ago
- A database of clean and noisy speech for audio research☆10Jan 26, 2018Updated 8 years ago
- Analyzes signal, finds fundamental frequency, HNR etc☆15Aug 23, 2017Updated 8 years ago
- Python package for the extraction of speech features for sustained phonation☆12Aug 10, 2020Updated 5 years ago
- This project explores using machine learning methods for detection of Parkinson's disease using an individual's speech.☆15Nov 18, 2019Updated 6 years ago
- I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …☆12Feb 5, 2024Updated 2 years ago
- ☆88Dec 21, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)☆15May 27, 2020Updated 5 years ago
- Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)☆31Feb 28, 2025Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- GAN series for voice conversion on VCC2018 dataset☆17Aug 27, 2020Updated 5 years ago
- A longitudinal spontaneous speech (machine learning audio) dataset for dementia diagnosis.☆31Aug 15, 2022Updated 3 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- ☆21Jan 13, 2020Updated 6 years ago
- A Python toolbox for speech features extraction☆165Feb 8, 2023Updated 3 years ago
- ☆32Nov 24, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Jul 2, 2019Updated 6 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Dec 18, 2024Updated last year
- ☆28Nov 7, 2023Updated 2 years ago
- A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor …☆15Dec 8, 2022Updated 3 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago
- Speech Recognition for speakers with speech disorders due to diseases like Cerebral Palsy, Parkinson or Amyotrophic Lateral Sclerosis ALS…☆23Mar 26, 2017Updated 9 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆82Jun 7, 2024Updated last year
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Praat script for creation of pictures (waveform, spectrogram, pitch contour, aligned with a textgrid). It creates figures in PNG PDF wm…☆25Mar 9, 2026Updated 2 weeks ago
- Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure …☆17Feb 15, 2022Updated 4 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆140Jan 6, 2025Updated last year
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆18Dec 1, 2024Updated last year
- EMO-SUPERB submission☆51Oct 13, 2025Updated 5 months ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆2,141Jun 6, 2024Updated last year
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated 2 years ago