weimeng23 / speech-recognition-learning-resourcesLinks

A list of speech recognition learning resources including courses, books, tutorials, papers and toolkits.

☆58

Alternatives and similar repositories for speech-recognition-learning-resources

Users that are interested in speech-recognition-learning-resources are comparing it to the libraries listed below

Sorting:

khanld / ASR-Wav2vec-Finetune
Finetune Wa2vec 2.0 For Speech Recognition
☆141Updated 5 months ago
joonson / voxconverse
Spot the conversation: speaker diarisation in the wild
☆141Updated 2 years ago
lorenlugosch / transducer-tutorial
Example code for a neural transducer model.
☆64Updated last year
espnet / notebook
☆69Updated last month
luferrer / ConfidenceIntervals
Confidence interval computation for evaluation in machine learning using the bootstrapping approach
☆86Updated last year
Speech-Interaction-Technology-Aalto-U / itsp
Introduction to Speech Processing
☆99Updated last week
RevoSpeechTech / speech-datasets-collection
a curated list of speech datasets (110+ datasets, 75+ easy to download)
☆140Updated 2 years ago
revdotcom / speech-datasets
Various speech datasets made available to the public
☆123Updated 7 months ago
HLasse / multidiagnosis-speech
☆11Updated 2 years ago
stevenhillis / awesome-asr-contextualization
A curated list of awesome papers on contextualizing E2E ASR outputs
☆79Updated 2 years ago
khanld / Wav2vec2-Pretraining
Wav2vec 2.0 Self-Supervised Pretraining
☆48Updated 5 months ago
dobby-seo / Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
☆108Updated 2 years ago
mtkresearch / clairaudience
Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)
☆27Updated last year
marianne-m / brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
☆153Updated last month
skit-ai / SpeechLLM
This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…
☆115Updated last year
pyyush / SpecAugment
SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆83Updated 4 years ago
standing-o / Combined_Dataset_for_Speech_Emotion_Recognition
A collection of dataset consists of a total of 8 English speech datasets for SER
☆25Updated 6 months ago
Diamondfan / Child-ASR-Paper
A list of papers for child ASR
☆43Updated 9 months ago
Lhx94As / Awesome-Spoken-Language-Identification
An awesome spoken LID repository. (Working in progress
☆105Updated last year
ga642381 / Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
☆110Updated last year
farisalasmary / wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆75Updated 3 years ago
desh2608 / diarizer
Clustering-based methods for overlapping diarization
☆81Updated last year
FrenchKrab / IS2023-powerset-diarization
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
☆88Updated last year
halsay / ASR-TTS-paper-daily
Update ASR paper everyday
☆259Updated this week
tuanio / noisy-student-training-asr
Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem
☆96Updated last month
iiscleap / NISP-Dataset
☆30Updated 2 years ago
gemengtju / Tutorial_Speech_Signal_Processing
This repo summarizes the courses and materials for speech signal processing. You are kindly invited to pull requests.
☆95Updated 4 years ago
upskyy / Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)
☆142Updated 2 years ago
ekapolc / ASR_course
ASR course at Chula 2018
☆62Updated 7 years ago
sasv-challenge / SASVC2022_Baseline
Baseline for the Spoofing-aware Speaker Verification Challenge 2022
☆65Updated 3 years ago