DongKeon / Awesome-Speaker-DiarizationLinks
Some comprehensive papers about speaker diarization
☆302Updated 3 months ago
Alternatives and similar repositories for Awesome-Speaker-Diarization
Users that are interested in Awesome-Speaker-Diarization are comparing it to the libraries listed below
Sorting:
- Variational Bayes HMM over x-vectors diarization☆275Updated last year
- End-to-End Neural Diarization☆406Updated 4 years ago
- Diarization scoring tools.☆256Updated 2 years ago
- Update ASR paper everyday☆303Updated this week
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆175Updated 9 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆140Updated this week
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆145Updated 3 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆108Updated 2 years ago
- Spot the conversation: speaker diarisation in the wild☆144Updated 3 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆427Updated 3 weeks ago
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆680Updated 8 months ago
- UT-Sarulab MOS prediction system using SSL models☆256Updated last year
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆374Updated last year
- Speaker embedding (d-vector) trained with GE2E loss☆284Updated last year
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆416Updated 3 months ago
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆468Updated last year
- Target Speaker Extraction Toolkit☆192Updated last month
- Official repository of SepReformer for speech separation☆216Updated 7 months ago
- Easy-to-Use Speech MOS predictors☆311Updated last year
- see README☆356Updated last month
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆88Updated last year
- Conformer-based Metric GAN for speech enhancement☆376Updated last year
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆148Updated 2 years ago
- ☆86Updated 4 months ago
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆91Updated 5 months ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …