A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech
☆17Sep 22, 2023Updated 2 years ago
Alternatives and similar repositories for torgo_asr
Users that are interested in torgo_asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆14Dec 9, 2015Updated 10 years ago
- Script to perform statistical significance test between ASR hypotheses.☆23Aug 13, 2017Updated 8 years ago
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Toolkit to asses speech impairments in patients with neurological disorders☆59May 25, 2018Updated 7 years ago
- ☆34May 25, 2020Updated 5 years ago
- In this project, we wish to identify psychiatric disorders through patient's speech☆12Jun 6, 2021Updated 4 years ago
- Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model☆13Nov 25, 2019Updated 6 years ago
- Machine learning speaker characteristics☆43Mar 19, 2026Updated last week
- Blind Source Separation: Independent Component Analysis for EEG data with python-MNE package and SSVEP☆12Nov 26, 2018Updated 7 years ago
- This is application for dysarthria to improve their pronunciation by using deep learning☆10Dec 29, 2020Updated 5 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated last year
- VoxAngeles Corpus☆14Aug 23, 2025Updated 7 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆36Apr 29, 2025Updated 10 months ago
- (ICLR 2025) Multi-Task Corrupted Prediction for Learning Robust Audio-Visual Speech Representation☆15Apr 29, 2025Updated 10 months ago
- Raw waveform adaptation with SincNet☆12Mar 19, 2024Updated 2 years ago
- This repository can be used to perform Speech to Text Conversion in multiple Languages, e.g., It can convert whatever you are speaking in…☆11Oct 6, 2020Updated 5 years ago
- Supervised Speech Representation Learning for Parkinson's Disease Classification☆17Oct 26, 2021Updated 4 years ago
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)☆10Jun 2, 2021Updated 4 years ago
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆13Nov 28, 2024Updated last year
- Keras-based python framework to compute phonological posterior probabilities from audio files☆46Dec 27, 2022Updated 3 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆18Jun 17, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Oct 8, 2020Updated 5 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…☆11Oct 29, 2018Updated 7 years ago
- Official implementation of the paper "Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task L…☆11Feb 14, 2024Updated 2 years ago
- Tutorial on Kaldi for Brandeis ASR course☆75Jan 21, 2020Updated 6 years ago
- ☆17Jan 1, 2024Updated 2 years ago
- Attention-based LSTM model with the Aspect information to solve financial opinion mining problem (WWW 2018 shared task1)☆16Feb 26, 2019Updated 7 years ago
- VCCA Pytorch Implementation on MNIST dataset☆16Apr 10, 2018Updated 7 years ago
- ☆17Mar 20, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"☆43Sep 24, 2025Updated 6 months ago
- Use DEMUCS to split songs into multiple sources☆20Apr 11, 2022Updated 3 years ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated 3 weeks ago
- Pytorch-Kaldi implementation of SNN-based ASR systems☆18Feb 1, 2020Updated 6 years ago
- Publish Android to Google Play with Travis-CI☆18Oct 21, 2016Updated 9 years ago
- Speech Recognition for speakers with speech disorders due to diseases like Cerebral Palsy, Parkinson or Amyotrophic Lateral Sclerosis ALS…☆23Mar 26, 2017Updated 9 years ago
- A living document for all things Common Voice.☆14Jun 24, 2024Updated last year