Contains links to publicly available datasets for modeling health outcomes using speech and language.
☆129Jun 10, 2024Updated last year
Alternatives and similar repositories for speech-nlp-datasets
Users that are interested in speech-nlp-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Templates I use for Competitive Programming (CP).☆11Nov 1, 2020Updated 5 years ago
- scripts to model depression in speech and text☆74Jan 22, 2020Updated 6 years ago
- Matlab tools for pathological voice analysis☆14May 12, 2023Updated 2 years ago
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Sep 6, 2024Updated last year
- Official implementation for AVGN☆41Mar 24, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- feature extraction from speech signals☆393Jun 15, 2025Updated 10 months ago
- ☆18Nov 15, 2021Updated 4 years ago
- A database of clean and noisy speech for audio research☆10Jan 26, 2018Updated 8 years ago
- Python package for the extraction of speech features for sustained phonation☆12Aug 10, 2020Updated 5 years ago
- This project explores using machine learning methods for detection of Parkinson's disease using an individual's speech.☆15Nov 18, 2019Updated 6 years ago
- ☆18Mar 13, 2024Updated 2 years ago
- I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …☆12Feb 5, 2024Updated 2 years ago
- ☆88Dec 21, 2022Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)☆15May 27, 2020Updated 5 years ago
- Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)☆32Feb 28, 2025Updated last year
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- GAN series for voice conversion on VCC2018 dataset☆17Aug 27, 2020Updated 5 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆19Jul 16, 2024Updated last year
- A longitudinal spontaneous speech (machine learning audio) dataset for dementia diagnosis.☆32Aug 15, 2022Updated 3 years ago
- Novoic's linguistic feature extraction library☆37Jan 21, 2022Updated 4 years ago
- ☆21Jan 13, 2020Updated 6 years ago
- A Python toolbox for speech features extraction☆165Feb 8, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆32Nov 24, 2024Updated last year
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Jul 2, 2019Updated 6 years ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Dec 18, 2024Updated last year
- ☆28Nov 7, 2023Updated 2 years ago
- A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor …☆15Dec 8, 2022Updated 3 years ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆34Feb 21, 2025Updated last year
- Openfst mirror with some fixes☆15Aug 23, 2024Updated last year
- Speech Recognition for speakers with speech disorders due to diseases like Cerebral Palsy, Parkinson or Amyotrophic Lateral Sclerosis ALS…☆23Mar 26, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…☆82Jun 7, 2024Updated last year
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- A Praat script for creation of pictures (waveform, spectrogram, pitch contour, aligned with a textgrid). It creates figures in PNG PDF wm…☆25Mar 9, 2026Updated last month
- Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure …☆18Feb 15, 2022Updated 4 years ago
- Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'☆140Jan 6, 2025Updated last year
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆19Dec 1, 2024Updated last year