talhanai / speech-nlp-datasetsView external linksLinks
Contains links to publicly available datasets for modeling health outcomes using speech and language.
☆126Jun 10, 2024Updated last year
Alternatives and similar repositories for speech-nlp-datasets
Users that are interested in speech-nlp-datasets are comparing it to the libraries listed below
Sorting:
- Towards Intelligibility-Oriented Audio-Visual Speech Enhancement☆14Sep 6, 2024Updated last year
- Matlab tools for pathological voice analysis☆13May 12, 2023Updated 2 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Dec 5, 2023Updated 2 years ago
- Official implementation for AVGN☆40Mar 24, 2023Updated 2 years ago
- GAN series for voice conversion on VCC2018 dataset☆17Aug 27, 2020Updated 5 years ago
- ☆28Nov 7, 2023Updated 2 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆11Nov 3, 2020Updated 5 years ago
- Python package for the extraction of speech features for sustained phonation☆12Aug 10, 2020Updated 5 years ago
- This project explores using machine learning methods for detection of Parkinson's disease using an individual's speech.☆15Nov 18, 2019Updated 6 years ago
- Real-time melgan based on cpu !!!☆13Dec 3, 2019Updated 6 years ago
- I wanted guided tutorials on digital signal processing so I decided to create them. The result is this ebook: "Digital Signal Processing …☆12Feb 5, 2024Updated 2 years ago
- scripts to model depression in speech and text☆74Jan 22, 2020Updated 6 years ago
- feature extraction from speech signals☆390Jun 15, 2025Updated 8 months ago
- Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688☆12Dec 2, 2024Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago
- Repository of the WACV'24 paper "Can CLIP Help Sound Source Localization?"☆34Feb 21, 2025Updated 11 months ago
- ☆14Apr 2, 2023Updated 2 years ago
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆18May 17, 2023Updated 2 years ago
- This repository contains materials for the paper: Towards generating ambisonics using audio-visual cue for virtual reality☆13Jul 2, 2019Updated 6 years ago
- ☆17Oct 18, 2023Updated 2 years ago
- ☆12May 1, 2019Updated 6 years ago
- Analyzes signal, finds fundamental frequency, HNR etc☆15Aug 23, 2017Updated 8 years ago
- ☆17Nov 15, 2021Updated 4 years ago
- ☆32Nov 24, 2024Updated last year
- ☆87Dec 21, 2022Updated 3 years ago
- Praat-based tools for EGG analysis☆18Sep 21, 2023Updated 2 years ago
- Code for InterSpeech 2024 Paper: LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition☆18Jul 16, 2024Updated last year
- Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)☆15May 27, 2020Updated 5 years ago
- Official implementation of SwinGANMR☆16Sep 5, 2022Updated 3 years ago
- Official Implementation of our Interspeech 2021 paper "An Empirical Study on Channel Effects for Synthetic Voice Spoofing Countermeasure …☆16Feb 15, 2022Updated 4 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- Audio-Visual Corruption Modeling of our paper "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling an…☆35Jun 20, 2023Updated 2 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Sep 24, 2021Updated 4 years ago
- ☆15Jun 15, 2022Updated 3 years ago
- A comparative analysis of speech signal processing algorithms for Parkinson’s disease classification and the use of the tunable Q-factor …☆15Dec 8, 2022Updated 3 years ago
- Implementation of few-shot baseline for MedFMC☆17Nov 27, 2023Updated 2 years ago
- Cantonese Selfish Project 廣東話自肥企劃 at PYCON HK 2021☆15Feb 16, 2022Updated 4 years ago
- Accepted by TMM 2022☆19Aug 18, 2022Updated 3 years ago
- ☆18Mar 13, 2024Updated last year