weimeng23 / speech-recognition-learning-resourcesLinks
A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.
☆57Updated last year
Alternatives and similar repositories for speech-recognition-learning-resources
Users that are interested in speech-recognition-learning-resources are comparing it to the libraries listed below
Sorting:
- ☆67Updated 5 months ago
- Example code for a neural transducer model.☆61Updated last year
- Clustering-based methods for overlapping diarization☆81Updated last year
- Finetune Wa2vec 2.0 For Speech Recognition☆131Updated 4 months ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆141Updated 2 years ago
- Spot the conversation: speaker diarisation in the wild☆140Updated 2 years ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated last year
- ☆30Updated 2 years ago
- A collection of dataset consists of a total of 8 English speech datasets for SER☆22Updated 4 months ago
- PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper☆12Updated 3 years ago
- A curated list of awesome papers on contextualizing E2E ASR outputs☆77Updated 2 years ago
- Introduction to Speech Processing☆95Updated 3 months ago
- Predicts the level of noise and reverberation on your audiofiles☆151Updated last year
- Awesome Automatic Speech Recognition (ASR) paper collection☆19Updated 4 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆53Updated 3 months ago
- ☆40Updated last year
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆64Updated 3 years ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆132Updated 2 years ago
- A list of papers for child ASR☆42Updated 7 months ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆46Updated 2 years ago
- This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.☆108Updated last year
- ☆17Updated 2 years ago
- ☆31Updated 7 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- An effort to track benchmarking results over widely-used datasets for ASR.☆46Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆107Updated 2 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆83Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆20Updated last year
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆80Updated 4 years ago