yinruiqing / change_detectionView external linksLinks
Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks
☆65Jul 14, 2020Updated 5 years ago
Alternatives and similar repositories for change_detection
Users that are interested in change_detection are comparing it to the libraries listed below
Sorting:
- Paper: https://arxiv.org/abs/1702.02285☆65Dec 19, 2018Updated 7 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆242Dec 16, 2025Updated last month
- Discriminative Neural Clustering for Speaker Diarisation☆79Apr 8, 2022Updated 3 years ago
- Android Application to perform Speaker Diarization☆24Mar 28, 2021Updated 4 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Feb 25, 2019Updated 6 years ago
- Speaker diarization scripts, based on AaltoASR☆191Jan 3, 2019Updated 7 years ago
- Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.☆545Sep 25, 2024Updated last year
- speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition☆498Jul 1, 2021Updated 4 years ago
- FEERCI: A Package for Fast non-parametric confidence intervals for Equal Error Rates☆12Mar 13, 2024Updated last year
- Punctuation generation for speech transcripts using lexical and prosodic features☆42Mar 5, 2019Updated 6 years ago
- PyTorch implementation of RPNSD☆60Jun 17, 2024Updated last year
- ☆14Aug 9, 2018Updated 7 years ago
- Diarization scoring tools.☆263Mar 28, 2023Updated 2 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- Support tools for punctuation and boundary detection for ASR output.☆55Dec 8, 2022Updated 3 years ago
- Speaker diarization via transfer learning☆27Mar 27, 2019Updated 6 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆122Jul 6, 2017Updated 8 years ago
- Online streaming speaker change detection model in Pytorch☆44Apr 14, 2023Updated 2 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- An implementation of DTW for spoken term detection. Including non-constrained, segmental DTW, slope-constrained versions. For more detail…☆15Jun 2, 2019Updated 6 years ago
- End-to-End Neural Diarization☆421Aug 30, 2021Updated 4 years ago
- Barista is an open-source framework for concurrent speech processing.☆36Mar 19, 2014Updated 11 years ago
- Audio-visual diarization pipeline used for creating VoxConverse dataset☆21Jun 6, 2025Updated 8 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- ☆22Jun 30, 2021Updated 4 years ago
- wake word spotting with kaldi☆19Dec 3, 2020Updated 5 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,844Jul 22, 2025Updated 6 months ago
- Scripts for training general-purpose large vocabulary German acoustic models for ASR with Kaldi.☆175Aug 9, 2023Updated 2 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- A real-time document recommendation system for speech streams☆19Jul 11, 2018Updated 7 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Nov 11, 2020Updated 5 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Apr 15, 2020Updated 5 years ago
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- An Open Source Tools for Speaker Recognition☆634Aug 5, 2024Updated last year
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago