philipperemy / speaker-change-detectionView external linksLinks
Paper: https://arxiv.org/abs/1702.02285
☆65Dec 19, 2018Updated 7 years ago
Alternatives and similar repositories for speaker-change-detection
Users that are interested in speaker-change-detection are comparing it to the libraries listed below
Sorting:
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Jul 14, 2020Updated 5 years ago
- Online streaming speaker change detection model in Pytorch☆44Apr 14, 2023Updated 2 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆498Jul 1, 2021Updated 4 years ago
- SNAIL Attention Block for Keras.☆17Mar 30, 2020Updated 5 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Feb 25, 2019Updated 6 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆56May 26, 2025Updated 8 months ago
- ☆16Mar 7, 2019Updated 6 years ago
- Keras implementation of SDE-Net (ICML 2020).☆16Sep 11, 2020Updated 5 years ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆79Apr 8, 2022Updated 3 years ago
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Feb 20, 2018Updated 7 years ago
- Speaker diarization scripts, based on AaltoASR☆191Jan 3, 2019Updated 7 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Apr 15, 2020Updated 5 years ago
- End-to-End Neural Diarization☆421Aug 30, 2021Updated 4 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,844Jul 22, 2025Updated 6 months ago
- ☆13Oct 3, 2025Updated 4 months ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- [ICASSP 2023] Tempo vs. Pitch: understanding self-supervised tempo estimation☆13Aug 2, 2023Updated 2 years ago
- Code for "Error-driven Fixed-Budget ASR Personalization for Accented Speakers" in ICASSP 2021☆11Jun 13, 2021Updated 4 years ago
- Unsupervised speech activity detection system.☆11Jul 2, 2018Updated 7 years ago
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Dec 15, 2022Updated 3 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Jan 8, 2021Updated 5 years ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆175Aug 9, 2023Updated 2 years ago
- Score calibration for speaker verification☆26Dec 13, 2019Updated 6 years ago
- File repository for the course [Advanced Deep Learning with Keras]. Packt Publishing.☆29Feb 26, 2018Updated 7 years ago
- Constrained Permutation Invariant Training, Speech Separation☆52Jan 24, 2021Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…☆10Dec 25, 2019Updated 6 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Apr 16, 2024Updated last year
- Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.☆545Sep 25, 2024Updated last year
- CMU multilingual speech repository☆30Apr 15, 2022Updated 3 years ago