idiap / torgo_asrView external linksLinks
A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech
☆17Sep 22, 2023Updated 2 years ago
Alternatives and similar repositories for torgo_asr
Users that are interested in torgo_asr are comparing it to the libraries listed below
Sorting:
- ASR for dysarthric speakers with Kaldi☆13Jan 14, 2017Updated 9 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- Correspondence and autoencoder neural network training for speech using Pylearn2.☆14Dec 9, 2015Updated 10 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Jul 25, 2024Updated last year
- Script to perform statistical significance test between ASR hypotheses.☆22Aug 13, 2017Updated 8 years ago
- Toolkit to asses speech impairments in patients with neurological disorders☆58May 25, 2018Updated 7 years ago
- ☆37Sep 21, 2025Updated 4 months ago
- ☆34May 25, 2020Updated 5 years ago
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 6 years ago
- Machine learning speaker characteristics☆43Updated this week
- Creates CMM script that can directly executed on Kaggle from easy merge script☆13Jan 12, 2026Updated last month
- [Interspeech 2025] Official implementation of "Training-Free Voice Conversion with Factorized Optimal Transport"☆43Sep 24, 2025Updated 4 months ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 3 years ago
- Using large language models to maintain AI_CHANGELOG.md☆14Jul 15, 2024Updated last year
- [Interspeech 2024] Enhancing Dysarthric Speech Recognition for Unseen Speakers via Prototype-Based Adaptation☆12Nov 28, 2024Updated last year
- Phase Vocoder and Wavelet Transform Implementation for Pitch Shifting a sound signal☆11Jul 27, 2020Updated 5 years ago
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆17Dec 7, 2025Updated 2 months ago
- ☆11Aug 11, 2023Updated 2 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 10 months ago
- Spin up any(almost) llm locally!☆14Dec 4, 2023Updated 2 years ago
- This is application for dysarthria to improve their pronunciation by using deep learning☆10Dec 29, 2020Updated 5 years ago
- ☆16Feb 18, 2024Updated last year
- msglm makes it a little easier to create messages for language models like Claude and OpenAI GPTs.☆14Jan 29, 2026Updated 2 weeks ago
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)☆10Jun 2, 2021Updated 4 years ago
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated 10 months ago
- A chat implementation for FastHTML☆11Sep 14, 2025Updated 5 months ago
- Datasets of audio adversarial examples for deep speech recognition systems and Python code of a detection system☆12May 6, 2023Updated 2 years ago
- Open Source Speech Inferencing Libary for Indic Languages☆13Apr 11, 2022Updated 3 years ago
- ☆10Apr 8, 2024Updated last year
- This is not remotely close to a finished product, and does not intend to nor does this claim to be working fine-tuning code for MaskGCT. …☆13Dec 4, 2024Updated last year
- Evaluation metrics and submission file creation scripts the Action Recognition challenge☆14Updated this week
- Activity Grammars for Temporal Action Segmentation (NeurIPS 2023)☆14Jun 14, 2024Updated last year
- eSNN - Learning similarity measure from data☆12Nov 28, 2019Updated 6 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆46Dec 27, 2022Updated 3 years ago
- An example AWS SAM app showing how to deploy a fastai app using Lambda Container feature☆13Dec 6, 2020Updated 5 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- OpenCV Sample Projects in Rust☆12Nov 27, 2021Updated 4 years ago
- Depression-Detection represents a machine learning algorithm to classify audio using acoustic features in human speech, thus detecting de…☆14Jul 10, 2020Updated 5 years ago
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year