talhanai/speech-nlp-datasets

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/talhanai/speech-nlp-datasets)

talhanai / speech-nlp-datasets

Contains links to publicly available datasets for modeling health outcomes using speech and language.

☆129

Alternatives and similar repositories for speech-nlp-datasets

Users that are interested in speech-nlp-datasets are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tusharnankani / cp-templates
View on GitHub
Templates I use for Competitive Programming (CP).
☆11Nov 1, 2020Updated 5 years ago
Mak-Sim / Troparion
View on GitHub
Matlab tools for pathological voice analysis
☆14May 12, 2023Updated 3 years ago
cogmhear / Intelligibility-Oriented-Audio-Visual-Speech-Enhancement
View on GitHub
Towards Intelligibility-Oriented Audio-Visual Speech Enhancement
☆15Sep 6, 2024Updated last year
mdhk / awesome-speech-interpretability
View on GitHub
A curated list of interpretability work for speech processing models
☆15Aug 25, 2025Updated 10 months ago
ffxiong / uaspeech
View on GitHub
Baseline kaldi script for UA-SPEECH corpus
☆32Oct 16, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
kamperh / globalphone_awe
View on GitHub
Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.
☆11Nov 3, 2020Updated 5 years ago
stoneMo / AVGN
View on GitHub
Official implementation for AVGN
☆41Mar 24, 2023Updated 3 years ago
jcvasquezc / DisVoice
View on GitHub
feature extraction from speech signals
☆394Updated this week
aeesha-T / parkinsons_prediction_using_speech
View on GitHub
☆18Nov 15, 2021Updated 4 years ago
adachille / parkinsons-detector
View on GitHub
This project explores using machine learning methods for detection of Parkinson's disease using an individual's speech.
☆15Nov 18, 2019Updated 6 years ago
brookemosby / Speech_Analysis
View on GitHub
Analyzes signal, finds fundamental frequency, HNR etc
☆15Aug 23, 2017Updated 8 years ago
bootphon / sustained-phonation-features
View on GitHub
Python package for the extraction of speech features for sustained phonation
☆12Aug 10, 2020Updated 5 years ago
tuanct1997 / Federated-Learning-ASR-based-on-wav2vec-2.0
View on GitHub
☆18Mar 13, 2024Updated 2 years ago
DTaoo / DMC
View on GitHub
Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)
☆15May 27, 2020Updated 6 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
megseekosh / dsp_tutorials
View on GitHub
I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …
☆12Feb 5, 2024Updated 2 years ago
JoungheeKim / K-wav2vec
View on GitHub
☆87Dec 21, 2022Updated 3 years ago
WangHelin1997 / DuTa-VC
View on GitHub
Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…
☆38Dec 5, 2023Updated 2 years ago
novoic / blabla
View on GitHub
Novoic's linguistic feature extraction library
☆38Jan 21, 2022Updated 4 years ago
Sreyan88 / LipGER
View on GitHub
Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
☆19Jul 16, 2024Updated 2 years ago
MingjieChen / VoiceConversionGANs
View on GitHub
GAN series for voice conversion on VCC2018 dataset
☆17Aug 27, 2020Updated 5 years ago
dcleres / Parkinson_Disease_ML
View on GitHub
A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor …
☆15Dec 8, 2022Updated 3 years ago
athena-team / athena-transform
View on GitHub
☆21Jan 13, 2020Updated 6 years ago
ArezoShakeri / MultiConAD
View on GitHub
This repo contains the code for generating the multilingual dataset introduced in the paper "MultiConAD: A Unified Multilingual Conversat…
☆20Jul 9, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
soumimaiti / speechlmscore_tool
View on GitHub
☆34Nov 24, 2024Updated last year
bootphon / shennong
View on GitHub
A Python toolbox for speech features extraction
☆166Feb 8, 2023Updated 3 years ago
Speech-Lab-IITM / CCC-wav2vec-2.0
View on GitHub
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆23Mar 18, 2024Updated 2 years ago
TUIlmenauAMS / FilterBanks_PythonKerasNeuralNetworkImplemention
View on GitHub
Filter Bank Implementaion as Convolutional Neural Network using Python Keras
☆17Dec 18, 2024Updated last year
TaoRuijie / SEANet
View on GitHub
Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)
☆32Feb 28, 2025Updated last year
iamyuanchung / Autoregressive-Predictive-Coding
View on GitHub
Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning
☆191Jan 29, 2020Updated 6 years ago
suhaspillai / Speech-Recognition-Impaired-Speech
View on GitHub
Speech Recognition for speakers with speech disorders due to diseases like Cerebral Palsy, Parkinson or Amyotrophic Lateral Sclerosis ALS…
☆23Mar 26, 2017Updated 9 years ago
swimmiing / ACL-SSL
View on GitHub
Repository of the IJCV'26 & WACV'24 paper
☆34Apr 27, 2026Updated 2 months ago
ayanglab / SwinGANMR
View on GitHub
Official implementation of SwinGANMR
☆17Sep 5, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
rgzn-aiyun / melgan-cpu
View on GitHub
Real-time melgan based on cpu ！！！
☆13Dec 3, 2019Updated 6 years ago
WangHelin1997 / SpeechTasks
View on GitHub
This is a list of speech tasks and datasets, which can provide training data for Generative AI, AIGC, AI model training, intelligent spee…
☆83Jun 7, 2024Updated 2 years ago
wendyelviragarcia / create_pictures
View on GitHub
A Praat script for creation of pictures (waveform, spectrogram, pitch contour, aligned with a textgrid). It creates figures in PNG PDF wm…
☆25Mar 9, 2026Updated 4 months ago
habla-liaa / ser-with-w2v2
View on GitHub
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
☆140Jan 6, 2025Updated last year
MingjieChen / LowResourceVC
View on GitHub
Voice conversion training with 109 speakers with limited training samples
☆35Dec 21, 2020Updated 5 years ago
GPUPhobia / vocal-mask
View on GitHub
☆12May 1, 2019Updated 7 years ago
YoshikiMas / madeon-asr
View on GitHub
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
☆19Dec 1, 2024Updated last year