Paper: https://arxiv.org/abs/1702.02285
☆65Dec 19, 2018Updated 7 years ago
Alternatives and similar repositories for speaker-change-detection
Users that are interested in speaker-change-detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks☆65Jul 14, 2020Updated 5 years ago
- Online streaming speaker change detection model in Pytorch☆43Apr 14, 2023Updated 2 years ago
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆498Jul 1, 2021Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆58May 26, 2025Updated 10 months ago
- ☆14Aug 9, 2018Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Nov 5, 2021Updated 4 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Apr 15, 2020Updated 5 years ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- SNAIL Attention Block for Keras.☆17Mar 30, 2020Updated 5 years ago
- Speaker diarization scripts, based on AaltoASR☆191Jan 3, 2019Updated 7 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,854Jul 22, 2025Updated 8 months ago
- A curated list of awesome Voiceprint Recognition papers☆19Jul 9, 2021Updated 4 years ago
- End-to-End Neural Diarization☆423Aug 30, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.☆546Sep 25, 2024Updated last year
- Discriminative Neural Clustering for Speaker Diarisation☆79Apr 8, 2022Updated 3 years ago
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago
- Probabilistic Linear Discriminant Analysis & classification, written in Python.☆130Mar 28, 2022Updated 4 years ago
- File repository for the course [Advanced Deep Learning with Keras]. Packt Publishing.☆29Feb 26, 2018Updated 8 years ago
- ☆16Mar 7, 2019Updated 7 years ago
- ☆52Oct 17, 2023Updated 2 years ago
- A deep neural network for finding text-independent speaker embedding written in tensorflow and tensorpack☆10Feb 19, 2018Updated 8 years ago
- Keras implementation of SDE-Net (ICML 2020).☆16Sep 11, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆95Jul 6, 2023Updated 2 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 6 years ago
- Emotion_Voice_Recognition_Chainer☆31Jan 26, 2016Updated 10 years ago
- Wav2kws is keyword spotting (KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Google Speech Commands datasets V1 and V2.☆13Jun 11, 2021Updated 4 years ago
- Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…☆64Jan 8, 2021Updated 5 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Feb 20, 2018Updated 8 years ago
- ☆16Feb 19, 2026Updated last month
- Speech separation with utterance-level PIT experiments☆106Jul 12, 2018Updated 7 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆445Aug 12, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆21Sep 24, 2018Updated 7 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Feb 25, 2019Updated 7 years ago
- ☆22Mar 22, 2017Updated 9 years ago
- This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…☆1,588Sep 25, 2024Updated last year
- CMU multilingual speech repository☆30Apr 15, 2022Updated 3 years ago
- Constrained Permutation Invariant Training, Speech Separation☆52Jan 24, 2021Updated 5 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆21Jan 24, 2022Updated 4 years ago