cristinae / ASRdysView external linksLinks
ASR for dysarthric speakers with Kaldi
☆13Jan 14, 2017Updated 9 years ago
Alternatives and similar repositories for ASRdys
Users that are interested in ASRdys are comparing it to the libraries listed below
Sorting:
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Sep 22, 2023Updated 2 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Oct 16, 2024Updated last year
- ☆34May 25, 2020Updated 5 years ago
- Attention-based model for keywords spotting☆19Aug 9, 2021Updated 4 years ago
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 6 years ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32May 16, 2019Updated 6 years ago
- Supporting code for instrumentation courses at Universidade Nova de Lisboa - Faculdade de Ciência de Lisboa☆16Oct 7, 2022Updated 3 years ago
- This repository is the implementation of the HiPAMA architecture, introduced in the paper, Hierarchical Pronunciation Assessment with Mul…☆38Apr 29, 2024Updated last year
- Repository for fine-tuning BEATs and using BEATs as feature extractor in a prototypical network. This repository has been used to complet…☆34Dec 28, 2025Updated last month
- A codebase for data crawling and preprocessing for TTS and ASR systems training.☆22Feb 5, 2026Updated last week
- Tool for slot extraction from text☆15Oct 23, 2022Updated 3 years ago
- The implementation for "Large Language Model Can Transcribe Speech in Multi-Talker Scenarios with Versatile Instructions"☆50Apr 7, 2025Updated 10 months ago
- [CVPR 2024] "Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition"☆12Feb 27, 2024Updated last year
- [CVPR 2024] Code and datasets for 'Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos'☆13Jun 16, 2024Updated last year
- A self contained example demonstrating how to use MediaPipe Object Detection with Max's jweb☆12Jun 26, 2023Updated 2 years ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- Dynamically build a chain of DSP with poly~ objects inside poly~ objects☆10Aug 1, 2019Updated 6 years ago
- Code examples for Smaller C, O'Reilly☆14Mar 22, 2021Updated 4 years ago
- Code for paper Audio Visual Speaker Localization from EgoCentric Views☆11Jul 3, 2024Updated last year
- Clustering algorithms (Mean shift and K-Means) from scratch in NumPy, PyTorch, TensorFlow, and JAX☆11Oct 3, 2022Updated 3 years ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Dec 25, 2019Updated 6 years ago
- ☆14Jul 1, 2024Updated last year
- vectorized decimal parsing☆13Dec 17, 2022Updated 3 years ago
- Official implementation of the paper "Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task L…☆11Feb 14, 2024Updated 2 years ago
- Qualifying Exam Preparing☆16May 7, 2025Updated 9 months ago
- c++ libraries required for all of my projects☆16Jan 6, 2026Updated last month
- ☆11Jul 6, 2022Updated 3 years ago
- A repository that is a workflow of Anthony Reis's "Writing Interpreters and Compilers for the Raspberry Pi Using Python"☆10Apr 1, 2021Updated 4 years ago
- Node.js module to capture PCM data with ALSA.☆12Feb 4, 2023Updated 3 years ago
- [CVPR 2023] Cascade Evidential Learning for Open-world Weakly-supervised Temporal Action Localization☆12Jul 9, 2024Updated last year
- ☆11Nov 8, 2021Updated 4 years ago
- Audio WAV file tools for C# read and write, 8 and 16 bits, mono and stereo.☆11Oct 20, 2015Updated 10 years ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- Code for "ATTA: Anomaly-aware Test-Time Adaptation for Out-of-Distribution Detection in Segmentation" (NeurIPS 23)☆14Apr 12, 2024Updated last year
- A query by humming system based on locality sensitive hashing indexes☆12May 8, 2014Updated 11 years ago
- Introduction to Digital SIgnal Processing and pd☆13Nov 27, 2025Updated 2 months ago
- Annotations and scripts for use with University of Wisconsin X-Ray Microbeam Speech Production Database (1994)☆13Oct 8, 2020Updated 5 years ago
- Quizzes and assignment submissions for DeepLearning.AI TensorFlow Developer Professional Certificate program☆12Nov 6, 2023Updated 2 years ago
- CounterGeDi is a pipeline that aims at controlling the counter speech generated to make it emotional, polite and detoxified. Paper accept…☆11Jul 19, 2022Updated 3 years ago