bbc / bbc-speech-segmenter
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
☆27Updated 8 months ago
Alternatives and similar repositories for bbc-speech-segmenter:
Users that are interested in bbc-speech-segmenter are comparing it to the libraries listed below
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- Constrained Permutation Invariant Training, Speech Separation☆46Updated 4 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- A curated list of awesome Speech Enhancement papers, libraries, datasets, and other resources.☆65Updated 5 years ago
- This repo contains required files for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment (PLC) Challenge.☆82Updated 3 months ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 3 years ago
- Text frontend for ESPnet tts recipes☆31Updated 3 years ago
- a MUSHRA compliant web audio API based experiment software☆10Updated 3 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Paderbox: A collection of utilities for audio / speech processing☆38Updated 8 months ago
- ☆33Updated 3 years ago
- ☆110Updated 3 years ago
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- Support for Clarity Enhancement and Prediction Challenges (obsolete - see README)☆48Updated 2 years ago
- This is a curated list of awesome Speech Bandwidth Extension tutorials, papers, libraries, datasets, tools, scripts and results. The purp…☆64Updated 4 years ago
- ☆19Updated last year
- Clustering-based methods for overlapping diarization☆75Updated last year
- ☆22Updated 3 years ago
- Evaluation and Benchmarking of Speech Super-resolution Methods☆146Updated 2 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated last year
- TFGAN: Time and Frequency Domain Based Generative Adversarial Network for High-fidelity Speech Synthesis☆87Updated 3 years ago
- This code is to run the WARP-Q speech quality metric.☆34Updated 4 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆54Updated last year
- A simple package for Guided source separation (GSS)☆114Updated 9 months ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆49Updated 9 months ago
- ☆52Updated last year
- ☆17Updated 5 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- Objective measures of speech quality SNR☆18Updated 5 years ago