weimeng23 / speech-recognition-learning-resources
A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.
☆56Updated 11 months ago
Alternatives and similar repositories for speech-recognition-learning-resources:
Users that are interested in speech-recognition-learning-resources are comparing it to the libraries listed below
- Finetune Wa2vec 2.0 For Speech Recognition☆129Updated 2 months ago
- Example code for a neural transducer model.☆61Updated last year
- Introduction to Speech Processing☆86Updated 2 months ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆80Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆37Updated last year
- ☆29Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆81Updated last year
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆12Updated 3 years ago
- A unified dataset of multilingual emotional human utterances☆25Updated 3 years ago
- ☆66Updated 4 months ago
- ☆17Updated 2 years ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆43Updated last year
- ☆46Updated 2 years ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆22Updated last month
- A list of papers for child ASR☆39Updated 6 months ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆108Updated last year
- Various speech datasets made available to the public☆116Updated 4 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- ☆11Updated last year
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated last year
- Clustering-based methods for overlapping diarization☆81Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆52Updated 2 months ago
- Wav2vec 2.0 Self-Supervised Pretraining☆43Updated 2 months ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆130Updated 2 years ago
- Whisper finetuned on VinBigdata-VLSP2020-100h + KenLM☆39Updated last year
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Updated last year
- Script to perform statistical significance test between ASR hypotheses.☆22Updated 7 years ago
- Balanced Error Rate for Speaker Diarization☆30Updated 2 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 4 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆21Updated last year