ASR for dysarthric speakers with Kaldi
☆13Jan 14, 2017Updated 9 years ago
Alternatives and similar repositories for ASRdys
Users that are interested in ASRdys are comparing it to the libraries listed below
Sorting:
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Sep 22, 2023Updated 2 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- Toolkit to asses speech impairments in patients with neurological disorders☆59May 25, 2018Updated 7 years ago
- ☆34May 25, 2020Updated 5 years ago
- ☆15Jun 15, 2022Updated 3 years ago
- Attention-based model for keywords spotting☆19Aug 9, 2021Updated 4 years ago
- Speech Recognition for speakers with speech disorders due to diseases like Cerebral Palsy, Parkinson or Amyotrophic Lateral Sclerosis ALS…☆23Mar 26, 2017Updated 8 years ago
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 6 years ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32May 16, 2019Updated 6 years ago
- Official PyTorch implementation of SGEM: Test-Time Adaptation for Automatic Speech Recognition via Sequential-Level Generalized Entropy M…☆37Aug 27, 2024Updated last year
- Supporting code for instrumentation courses at Universidade Nova de Lisboa - Faculdade de Ciência de Lisboa☆16Oct 7, 2022Updated 3 years ago
- Introduction to version control with RStudio☆10Jul 7, 2020Updated 5 years ago
- A lightweight library to read/write wave audio files to/from lists of native Python types.☆12Jun 10, 2024Updated last year
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆38Apr 29, 2024Updated last year
- Repository for fine-tuning BEATs and using BEATs as feature extractor in a prototypical network. This repository has been used to complet…☆34Dec 28, 2025Updated 2 months ago
- Tool for slot extraction from text☆15Oct 23, 2022Updated 3 years ago
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 26, 2026Updated last week
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 10 months ago
- Pre-trained Wav2vec2.0 for Mandarin☆43Oct 30, 2022Updated 3 years ago
- VoxAngeles Corpus☆13Aug 23, 2025Updated 6 months ago
- ☆12Aug 5, 2022Updated 3 years ago
- In this project, we wish to identify psychiatric disorders through patient's speech☆12Jun 6, 2021Updated 4 years ago
- Official PyTorch code for "Vector Quantization Prompting for Continual Learning (NeurIPS2024)".☆10Oct 16, 2024Updated last year
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- Dynamically build a chain of DSP with poly~ objects inside poly~ objects☆10Aug 1, 2019Updated 6 years ago
- Python Tools for the POP Metrics☆13Feb 16, 2022Updated 4 years ago
- Depression-Detection represents a machine learning algorithm to classify audio using acoustic features in human speech, thus detecting de…☆14Jul 10, 2020Updated 5 years ago
- Command line client for figshare☆19Jul 3, 2014Updated 11 years ago
- A self contained example demonstrating how to use MediaPipe Object Detection with Max's jweb☆12Jun 26, 2023Updated 2 years ago
- Functions for creating speech features in MATLAB.☆14Jul 7, 2020Updated 5 years ago
- Blind Monaural Source Separation on Heart and Lung Sounds Based on Periodic-Coded Deep Autoencoder☆12Apr 8, 2021Updated 4 years ago
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- Code examples for Smaller C, O'Reilly☆14Mar 22, 2021Updated 4 years ago
- Clustering algorithms (Mean shift and K-Means) from scratch in NumPy, PyTorch, TensorFlow, and JAX☆11Oct 3, 2022Updated 3 years ago
- ☆13May 21, 2024Updated last year
- Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.☆12Mar 15, 2025Updated 11 months ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated 2 years ago
- Datasets of audio adversarial examples for deep speech recognition systems and Python code of a detection system☆12May 6, 2023Updated 2 years ago