weimeng23 / speech-recognition-learning-resources
A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.
☆50Updated 9 months ago
Alternatives and similar repositories for speech-recognition-learning-resources:
Users that are interested in speech-recognition-learning-resources are comparing it to the libraries listed below
- Example code for a neural transducer model.☆61Updated last year
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆11Updated 2 years ago
- A list of papers for child ASR☆37Updated 4 months ago
- ☆28Updated 2 years ago
- Introduction to Speech Processing☆82Updated 4 months ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆77Updated 10 months ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated last year
- A curated list of awesome papers on contextualizing E2E ASR outputs☆76Updated last year
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆34Updated last year
- A unified dataset of multilingual emotional human utterances☆24Updated 3 years ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆122Updated 2 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆74Updated 3 years ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆136Updated 2 years ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆107Updated last year
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- Script to perform statistical significance test between ASR hypotheses.☆21Updated 7 years ago
- Wav2vec 2.0 Self-Supervised Pretraining☆39Updated 2 weeks ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆100Updated 4 months ago
- A collection of dataset consists of a total of 8 English speech datasets for SER☆17Updated last month
- ☆43Updated 2 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆57Updated this week
- Various speech datasets made available to the public☆113Updated 2 months ago
- ☆11Updated this week
- Clustering-based methods for overlapping diarization☆75Updated last year
- ☆43Updated 2 years ago
- An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…☆13Updated 3 years ago
- ☆16Updated 2 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆124Updated 2 weeks ago
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆102Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 9 months ago