FlorianKrey / DNCView external linksLinks
Discriminative Neural Clustering for Speaker Diarisation
☆79Apr 8, 2022Updated 3 years ago
Alternatives and similar repositories for DNC
Users that are interested in DNC are comparing it to the libraries listed below
Sorting:
- ☆21Sep 24, 2018Updated 7 years ago
- Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.☆545Sep 25, 2024Updated last year
- End-to-End Neural Diarization☆421Aug 30, 2021Updated 4 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Apr 15, 2020Updated 5 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- PyTorch implementation of RPNSD☆60Jun 17, 2024Updated last year
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Apr 13, 2022Updated 3 years ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆498Jul 1, 2021Updated 4 years ago
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- Diarization scoring tools.☆263Mar 28, 2023Updated 2 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Jul 14, 2020Updated 5 years ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Feb 20, 2020Updated 5 years ago
- ☆38May 16, 2022Updated 3 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆109Jun 19, 2023Updated 2 years ago
- Temporary anonymous version☆22Mar 20, 2024Updated last year
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- ☆33Nov 27, 2021Updated 4 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Nov 11, 2020Updated 5 years ago
- Google's TPGST reimplementation.☆34Dec 11, 2019Updated 6 years ago
- Anonymous ICLR Submission☆14Sep 25, 2019Updated 6 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,844Jul 22, 2025Updated 6 months ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Feb 25, 2019Updated 6 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆46Feb 6, 2025Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- ☆16Mar 7, 2019Updated 6 years ago
- PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.☆595Jan 20, 2022Updated 4 years ago
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Oct 10, 2019Updated 6 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Mar 21, 2021Updated 4 years ago
- ☆22Jun 30, 2021Updated 4 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Jan 18, 2023Updated 3 years ago
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆125Apr 8, 2022Updated 3 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆242Dec 16, 2025Updated last month
- Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language☆43Feb 28, 2018Updated 7 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Jul 6, 2023Updated 2 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Source code of paper <End-to-End Language Diarization for Bilingual Code-switching Speech>☆19Jan 23, 2022Updated 4 years ago
- Tools for Speech Enhancement integrated with Kaldi☆427Jul 6, 2023Updated 2 years ago