philipperemy/speaker-change-detection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/philipperemy/speaker-change-detection)

philipperemy / speaker-change-detection

Paper: https://arxiv.org/abs/1702.02285

☆64

Alternatives and similar repositories for speaker-change-detection

Users that are interested in speaker-change-detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yinruiqing / change_detection
View on GitHub
Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks
☆67Jul 14, 2020Updated 6 years ago
alumae / online_speaker_change_detector
View on GitHub
Online streaming speaker change detection model in Pytorch
☆44Apr 14, 2023Updated 3 years ago
taylorlu / Speaker-Diarization
View on GitHub
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
☆501Jul 1, 2021Updated 5 years ago
HHousen / speaker-change-detection
View on GitHub
Speaker change detection using SincNet and an LSTM/Transformer
☆57May 26, 2025Updated last year
yinruiqing / diarization_with_neural_approach
View on GitHub
☆14Aug 9, 2018Updated 7 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
DonkeyShot21 / uis-rnn-sml
View on GitHub
A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)
☆61Apr 15, 2020Updated 6 years ago
JaesungHuh / VoxSRC2022
View on GitHub
VoxSRC2022 workshop development kit
☆19Jul 21, 2022Updated 4 years ago
wq2012 / awesome-diarization
View on GitHub
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆1,886Jul 7, 2026Updated 3 weeks ago
philipperemy / keras-snail-attention
View on GitHub
SNAIL Attention Block for Keras.
☆17Mar 30, 2020Updated 6 years ago
aalto-speech / speaker-diarization
View on GitHub
Speaker diarization scripts, based on AaltoASR
☆191Jan 3, 2019Updated 7 years ago
nonday / awesome-voiceprint
View on GitHub
A curated list of awesome Voiceprint Recognition papers
☆19Jul 9, 2021Updated 5 years ago
hitachi-speech / EEND
View on GitHub
End-to-End Neural Diarization
☆435Aug 30, 2021Updated 4 years ago
FlorianKrey / DNC
View on GitHub
Discriminative Neural Clustering for Speaker Diarisation
☆79Apr 8, 2022Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
wq2012 / SpectralCluster
View on GitHub
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
☆553Sep 25, 2024Updated last year
cadia-lvl / punctuation-prediction
View on GitHub
Support tools for punctuation and boundary detection for ASR output.
☆55Dec 8, 2022Updated 3 years ago
philipperemy / advanced-deep-learning-keras
View on GitHub
File repository for the course [Advanced Deep Learning with Keras]. Packt Publishing.
☆29Feb 26, 2018Updated 8 years ago
RaviSoji / plda
View on GitHub
Probabilistic Linear Discriminant Analysis & classification, written in Python.
☆129Mar 28, 2022Updated 4 years ago
iiscleap / DIHARD-2019-baseline
View on GitHub
☆16Mar 7, 2019Updated 7 years ago
Akella17 / speaker-embedding
View on GitHub
A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack
☆10Feb 19, 2018Updated 8 years ago
Jamiroquai88 / VBDiarization
View on GitHub
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data
☆95Jul 6, 2023Updated 3 years ago
BUTSpeechFIT / AMI-diarization-setup
View on GitHub
☆54Oct 17, 2023Updated 2 years ago
philipperemy / keras-sde-net
View on GitHub
Keras implementation of SDE-Net (ICML 2020).
☆16Sep 11, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
ondrejklejch / acoustic_punctuation
View on GitHub
NMT based punctuation prediction system using lexical and acoustic features .
☆14Mar 30, 2020Updated 6 years ago
phanxuanphucnd / wav2kws
View on GitHub
Wav2kws is keyword spotting (KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Google Speech Commands datasets V1 and V2.
☆13Jun 11, 2021Updated 5 years ago
SnowMasaya / Emotion_Voice_Recognition_Chainer-
View on GitHub
Emotion_Voice_Recognition_Chainer
☆30Jan 26, 2016Updated 10 years ago
BornInWater / Overlap-Detection
View on GitHub
Overlapped Speech detection in Multi-party Conversations
☆22Feb 20, 2018Updated 8 years ago
funcwj / uPIT-for-speech-separation
View on GitHub
Speech separation with utterance-level PIT experiments
☆106Jul 12, 2018Updated 8 years ago
vishalshar / SpeakerDiarization_RNN_CNN_LSTM
View on GitHub
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…
☆64Jan 8, 2021Updated 5 years ago
xiaoxiaomiao323 / MSA
View on GitHub
☆16Feb 19, 2026Updated 5 months ago
kamperh / recipe_swbd_wordembeds
View on GitHub
☆22Mar 22, 2017Updated 9 years ago
EMRAI / emrai-synthetic-diarization-corpus
View on GitHub
☆22Sep 24, 2018Updated 7 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
google / speaker-id
View on GitHub
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…
☆453Aug 12, 2025Updated 11 months ago
sarangzambare / hey-siri
View on GitHub
This repository is for wake-word detection in speech using recurrent neural networks
☆18Feb 25, 2019Updated 7 years ago
google / uis-rnn
View on GitHub
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…
☆1,588Sep 25, 2024Updated last year
xuchenglin28 / speech_separation
View on GitHub
Constrained Permutation Invariant Training, Speech Separation
☆52Jan 24, 2021Updated 5 years ago
wavlab-speech / cmu_multilingual_speech
View on GitHub
CMU multilingual speech repository
☆30Apr 15, 2022Updated 4 years ago
daanzu / kaldi_ag_training
View on GitHub
Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…
☆21Jan 24, 2022Updated 4 years ago
dodohow1011 / TS-VAD
View on GitHub
☆55Jan 15, 2021Updated 5 years ago