Contains links to publicly available datasets for modeling health outcomes using speech and language.
☆129Jun 10, 2024Updated 2 years ago
Alternatives and similar repositories for speech-nlp-datasets
Users that are interested in speech-nlp-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Templates I use for Competitive Programming (CP).☆11Nov 1, 2020Updated 5 years ago
- scripts to model depression in speech and text☆74Jan 22, 2020Updated 6 years ago
- Matlab tools for pathological voice analysis☆14May 12, 2023Updated 3 years ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆15Sep 6, 2024Updated last year
- Official implementation for AVGN☆41Mar 24, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- feature extraction from speech signals☆395Jun 10, 2026Updated last week
- ☆18Nov 15, 2021Updated 4 years ago
- Python package for the extraction of speech features for sustained phonation☆12Aug 10, 2020Updated 5 years ago
- ☆18Mar 13, 2024Updated 2 years ago
- I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …☆12Feb 5, 2024Updated 2 years ago
- Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)☆15May 27, 2020Updated 6 years ago
- ☆88Dec 21, 2022Updated 3 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆38Dec 5, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- GAN series for voice conversion on VCC2018 dataset☆17Aug 27, 2020Updated 5 years ago
- A longitudinal spontaneous speech (machine learning audio) dataset for dementia diagnosis.☆32Aug 15, 2022Updated 3 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)☆32Feb 28, 2025Updated last year
- Novoic's linguistic feature extraction library☆37Jan 21, 2022Updated 4 years ago
- ☆21Jan 13, 2020Updated 6 years ago
- A Python toolbox for speech features extraction☆165Feb 8, 2023Updated 3 years ago
- ☆32Nov 24, 2024Updated last year
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Jul 2, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Dec 18, 2024Updated last year
- A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor …☆15Dec 8, 2022Updated 3 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago
- Repository of the IJCV'26 & WACV'24 paper☆34Apr 27, 2026Updated last month
- Openfst mirror with some fixes☆15Aug 23, 2024Updated last year
- Speech Recognition for speakers with speech disorders due to diseases like Cerebral Palsy, Parkinson or Amyotrophic Lateral Sclerosis ALS…☆23Mar 26, 2017Updated 9 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- ☆28Nov 7, 2023Updated 2 years ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆83Jun 7, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- A Praat script for creation of pictures (waveform, spectrogram, pitch contour, aligned with a textgrid). It creates figures in PNG PDF wm…☆25Mar 9, 2026Updated 3 months ago
- Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure …☆19Feb 15, 2022Updated 4 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆140Jan 6, 2025Updated last year
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆19Dec 1, 2024Updated last year
- EMO-SUPERB submission☆51Oct 13, 2025Updated 8 months ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆2,201Jun 6, 2024Updated 2 years ago