weimeng23 / speech-recognition-learning-resources
A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.
☆51Updated 10 months ago
Alternatives and similar repositories for speech-recognition-learning-resources:
Users that are interested in speech-recognition-learning-resources are comparing it to the libraries listed below
- Introduction to Speech Processing☆85Updated last month
- Example code for a neural transducer model.☆62Updated last year
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆78Updated 11 months ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆108Updated last year
- A list of papers for child ASR☆38Updated 5 months ago
- A collection of dataset consists of a total of 8 English speech datasets for SER☆18Updated 2 months ago
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆12Updated 3 years ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated last year
- ☆28Updated 2 years ago
- Predicts the level of noise and reverberation on your audiofiles☆148Updated 10 months ago
- A curated list of awesome papers on contextualizing E2E ASR outputs☆77Updated last year
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆139Updated 2 years ago
- Clustering-based methods for overlapping diarization☆78Updated last year
- ☆66Updated 3 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆126Updated 3 weeks ago
- Spot the conversation: speaker diarisation in the wild☆137Updated 2 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆51Updated last month
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆42Updated last year
- The official repository of Dynamic-SUPERB.☆176Updated 2 weeks ago
- ☆79Updated 7 months ago
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆64Updated 2 years ago
- ☆54Updated last year
- Various speech datasets made available to the public☆114Updated 3 months ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆102Updated 5 months ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- Script to perform statistical significance test between ASR hypotheses.☆21Updated 7 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆126Updated last month
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆93Updated 9 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆76Updated 10 months ago