bbc / bbc-speech-segmenter
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
☆28Updated 9 months ago
Alternatives and similar repositories for bbc-speech-segmenter:
Users that are interested in bbc-speech-segmenter are comparing it to the libraries listed below
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- ☆52Updated last year
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆68Updated 6 months ago
- Multistream CNN for Robust Acoustic Modeling☆40Updated 3 years ago
- ☆22Updated 3 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 8 months ago
- Clustering-based methods for overlapping diarization☆77Updated last year
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- ☆25Updated last month
- An unofficial implementation of the Personal VAD speaker-conditioned voice activity detection method. Bachelor's thesis project.☆65Updated 2 years ago
- Text frontend for ESPnet tts recipes☆31Updated 3 years ago
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.☆66Updated 5 years ago
- video cut powered by AI☆25Updated 2 years ago
- A simple package for Guided source separation (GSS)☆117Updated 10 months ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 4 years ago
- ☆61Updated last year
- Transcribing Speech with Multinomial Diffusion, training code and models.☆76Updated last year
- Python package for combining diarization system outputs.☆87Updated last year
- ☆19Updated last year
- Colab notebooks for Next-gen Kaldi☆26Updated last month
- ☆31Updated 11 months ago
- ☆33Updated 3 years ago
- ☆56Updated 2 years ago
- Flask-based web framework for visualisation and explorative listening of audio.☆53Updated last year
- ☆35Updated 2 weeks ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆82Updated last year
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆70Updated 2 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago