jim-schwoebel / awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆13Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for awesome-diarization
- Various speech datasets made available to the public☆99Updated last month
- ☆32Updated 2 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆71Updated 3 years ago
- This repository describes our reproducible framework for assessing self-supervised representation learning from speech☆51Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- Phonetically-Oriented Word Error Rate☆33Updated 5 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Updated 4 years ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated last year
- Example code for a neural transducer model.☆60Updated 9 months ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆13Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- asr2k☆48Updated 5 months ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆103Updated last year
- Articulatory features estimation using Listen Attend and Spell architecture.☆32Updated 4 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆67Updated last year
- ☆40Updated last year
- SHAS: Approaching optimal Segmentation for End-to-End Speech Translation☆37Updated last year
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 4 years ago
- Dataset of ICASSP 2021 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION☆37Updated last year
- An adaptation of Fairseq to (End-to-end) speech translation.☆22Updated 2 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆20Updated 3 years ago
- ☆75Updated 2 years ago
- Feature extractor for DL speech processing.☆65Updated 2 years ago
- Code for AccentDB.☆19Updated 3 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year