BornInWater/Overlap-Detection

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BornInWater/Overlap-Detection)

BornInWater / Overlap-Detection

Overlapped Speech detection in Multi-party Conversations

☆22

Alternatives and similar repositories for Overlap-Detection

Users that are interested in Overlap-Detection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yinruiqing / diarization_with_neural_approach
View on GitHub
☆14Aug 9, 2018Updated 7 years ago
iiscleap / DIHARD-2019-baseline
View on GitHub
☆16Mar 7, 2019Updated 7 years ago
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
fgnt / pb_chime5
View on GitHub
Speech enhancement system for the CHiME-5 dinner party scenario
☆110Feb 6, 2025Updated last year
staplesinLA / denoising_DIHARD18
View on GitHub
☆60Sep 26, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆18Aug 16, 2024Updated last year
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
iiscleap / DIHARD_2019_baseline_alltracks
View on GitHub
☆38May 16, 2022Updated 4 years ago
popcornell / OSDC
View on GitHub
☆18Jan 26, 2021Updated 5 years ago
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
TuZehai / Sheffield_Clarity_CEC1_Entry
View on GitHub
Implementation of Sheffield entry for Clarity enhancement challenge.
☆18Apr 19, 2022Updated 4 years ago
ybayle / ISM2017
View on GitHub
Reproducible research code for the experiments presented in our article "Kara1k: a karaoke dataset for cover song identification and sing…
☆10Jan 9, 2018Updated 8 years ago
WangHelin1997 / Automatic_Speech_Annotator
View on GitHub
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Jun 14, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Refefer / word2vec-scala
View on GitHub
Scala port of the word2vec toolkit.
☆11Aug 15, 2016Updated 9 years ago
tuanio / nextformer
View on GitHub
PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"
☆10Dec 15, 2022Updated 3 years ago
misskaseyann / acoustic-event-detection
View on GitHub
Acoustic event detection using recurrent neural networks.
☆11Sep 4, 2018Updated 7 years ago
HuangZiliAndy / SSL_for_multitalker
View on GitHub
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆33Mar 16, 2023Updated 3 years ago
Searcher408 / DNN-Speech-Enhancement-Task
View on GitHub
An Experimental Study on Speech Enhancement based on DNN.
☆14Aug 11, 2018Updated 7 years ago
iiscleap / NeuralPlda
View on GitHub
Implementation of Neural PLDA (NPLDA) model (A discriminative backend for Speaker Verification)
☆99Apr 20, 2020Updated 6 years ago
PecholaL / MAIN-VC
View on GitHub
Lightweight Speech Representation Learning for One-Shot Voice Conversion
☆23Dec 12, 2024Updated last year
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
mavceleb / mavceleb_baseline
View on GitHub
☆11Nov 5, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
EMRAI / emrai-synthetic-diarization-corpus
View on GitHub
☆22Sep 24, 2018Updated 7 years ago
7Xin / DPI-TTS
View on GitHub
☆13Sep 12, 2024Updated last year
maxrmorrison / pypar
View on GitHub
Phoneme alignment representation compatible with multiple forced aligners
☆22Apr 12, 2024Updated 2 years ago
WingZLeung / TTDS
View on GitHub
Text-to-dysarthric speech (TTDS) synthesis. An implementation using the Grad-TTS model with the TORGO database.
☆13Mar 15, 2025Updated last year
david-ryan-snyder / kaldi
View on GitHub
This is now the official location of the Kaldi project.
☆10Aug 22, 2019Updated 6 years ago
XiaoyuBIE1994 / SDCodec
View on GitHub
(ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec
☆48May 16, 2025Updated last year
prairie-schooner / wav2vec-vc
View on GitHub
☆10Mar 22, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
noajshu / scotus-speech
View on GitHub
Corpus of oral arguments (recorded speech + official transcripts) of the United States Supreme Court
☆22Dec 8, 2022Updated 3 years ago
ifnspaml / Perceptual-Weighting-Filter-Loss
View on GitHub
A perceptual weighting filter loss for DNN training in speech enhancement
☆24Apr 30, 2022Updated 4 years ago
bootphon / learnable-strf
View on GitHub
Learnable STRF, from Riad et al. 2021 JASA
☆13Aug 21, 2021Updated 4 years ago
hbredin / TristouNet
View on GitHub
TristouNet: Triplet Loss for Speaker Turn Embedding
☆121Jul 6, 2017Updated 9 years ago
shtoshni / g2p
View on GitHub
Code for SLT 2016 paper on Grapheme-to-Phoneme conversion using attention based encoder-decoder models
☆15Feb 20, 2019Updated 7 years ago
kamperh / speech_dtw
View on GitHub
Dynamic time warping (DTW) functions for specifically speech alignment.
☆30May 6, 2024Updated 2 years ago
3loi / NaturalVoices
View on GitHub
☆61Oct 22, 2025Updated 8 months ago