bbc / bbc-speech-segmenter
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
☆27Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for bbc-speech-segmenter
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated 2 years ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- ☆102Updated 3 years ago
- Clustering-based methods for overlapping diarization☆70Updated 10 months ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆53Updated last year
- Constrained Permutation Invariant Training, Speech Separation☆43Updated 3 years ago
- ☆55Updated 3 years ago
- Multi-Task Audio Source Separation, Two-Stage Model, Complex Domain.☆89Updated last year
- A list of papers for child ASR☆26Updated last month
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆46Updated 6 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆44Updated 4 months ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆58Updated 2 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆44Updated last week
- Unofficial implementation of miipher☆112Updated 7 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆71Updated last year
- ☆25Updated 3 months ago
- Multistream CNN for Robust Acoustic Modeling☆39Updated 3 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 4 months ago
- neural network based speaker embedder☆25Updated last year
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆58Updated 2 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model☆34Updated 4 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆99Updated last year
- A sequence-to-sequence voice conversion toolkit.☆86Updated 4 months ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 4 years ago
- ☆17Updated 3 months ago
- A simple package for Guided source separation (GSS)☆107Updated 6 months ago
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆25Updated 3 years ago
- ☆40Updated 2 years ago
- An effort to track benchmarking results over widely-used datasets for ASR.☆44Updated 2 years ago