cristinae / ASRdys
ASR for dysarthric speakers with Kaldi
☆13Updated 8 years ago
Alternatives and similar repositories for ASRdys:
Users that are interested in ASRdys are comparing it to the libraries listed below
- Baseline kaldi script for UA-SPEECH corpus☆29Updated 4 months ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆16Updated last year
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆41Updated 3 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆16Updated 2 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Updated 3 years ago
- ☆45Updated 2 months ago
- ☆15Updated 2 years ago
- ☆32Updated 3 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆22Updated 3 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆47Updated last month
- A list of papers for child ASR☆37Updated 4 months ago
- ☆30Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated last year
- implementation of Monaural Speech Enhancement with Recursive Learning in the Time Domain☆45Updated 4 years ago
- End-to-end waveform utterance enhancement for direct evaluation metrics optimization by fully convolutional neural networks (TASLP 2018)☆18Updated 5 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆61Updated 3 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Updated 5 years ago
- Typing to Listen at the Cocktail Party: Text-Guided Target Speaker Extraction (LLM-TSE)☆39Updated last year
- ☆51Updated 8 months ago
- Balanced Error Rate for Speaker Diarization☆29Updated last year
- Speech (audio) subjective evaluation system☆37Updated 4 years ago
- PyTorch implementation of WASE described in our ICASSP 2021: "Wase: Learning When to Attend for Speaker Extraction in Cocktail Party Envi…☆24Updated 3 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆64Updated 5 years ago
- Components loss for neural networks in mask-based speech enhancement☆33Updated 4 years ago
- ☆29Updated 2 years ago
- Official repo for the STRFNet system appeared in INTERSPEECH2020☆12Updated 3 years ago
- wsj0-{2, 3, 4, 5} mix generation scripts, in Python.☆55Updated 3 years ago