cristinae / ASRdysLinks
ASR for dysarthric speakers with Kaldi
☆13Updated 8 years ago
Alternatives and similar repositories for ASRdys
Users that are interested in ASRdys are comparing it to the libraries listed below
Sorting:
- Baseline kaldi script for UA-SPEECH corpus☆31Updated 11 months ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Updated last year
- Speech (audio) subjective evaluation system☆41Updated 5 years ago
- A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK☆60Updated 3 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆48Updated 8 months ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Updated 3 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆45Updated 6 years ago
- This is the code for controllable EVC framework for seen and unseen emotion generation.☆44Updated 3 years ago
- Making Espnet easier to use☆56Updated 4 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆61Updated 4 years ago
- End-To-End Speaker Verification based on X-vector and Neural PLDA - A PyTorch implementation☆22Updated 3 years ago
- ☆58Updated last year
- ☆29Updated 3 years ago
- Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.☆119Updated 2 years ago
- ☆32Updated 9 months ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆48Updated 3 years ago
- Neural network based similarity scoring for diarization (pytorch implementation of "LSTM based Similarity Measurement with Spectral Clust…☆44Updated 4 years ago
- ☆16Updated 6 years ago
- ☆54Updated last year
- MultiSV: scripts for data preparation☆27Updated 8 months ago
- Discriminative Condition-Aware PLDA☆44Updated last year
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆42Updated 2 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆64Updated 5 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆50Updated 6 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆31Updated 2 years ago
- Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch☆45Updated 5 years ago
- Attention-based model for keywords spotting☆19Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆42Updated 2 years ago
- End-to-end waveform utterance enhancement for direct evaluation metrics optimization by fully convolutional neural networks (TASLP 2018)☆18Updated 6 years ago