dr-pato/SSGD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dr-pato/SSGD)

dr-pato / SSGD

Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"

☆15

Alternatives and similar repositories for SSGD

Users that are interested in SSGD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yangyi0818 / DPARNet
View on GitHub
Dual-Path Attention and Recurrent Network for speech separation
☆22Sep 12, 2024Updated last year
Okrio / FSPEN
View on GitHub
☆21Apr 27, 2024Updated 2 years ago
vkothapally / JAECBF
View on GitHub
☆62Apr 11, 2022Updated 4 years ago
Le-Xiaohuai-speech / SKIP-DPCRN
View on GitHub
☆51Jun 14, 2022Updated 4 years ago
RusselZHANG / Microphone-Array-Generalization-for-Multichannel-Narrowband-Deep-Speech-Enhancement
View on GitHub
This is the microphone array generalization investigation based on previous Narrow Band Deep Filtering methods.
☆38Mar 12, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yucongzh / online_speaker_diarization
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
desh2608 / gss
View on GitHub
A simple package for Guided source separation (GSS)
☆134May 20, 2024Updated 2 years ago
ZhongshuHou / MHA-DPCRN
View on GitHub
We design a spectral compression mapping (SCM) for full-band speech enhancement, and propose a two-stage stream named MHA-DPCRN
☆24Jul 4, 2022Updated 4 years ago
Mo-yun / DSDPRNN
View on GitHub
Implementation of Dual-Stream DPRNN (paper: Nonlinear Residual Echo Suppression Based on Dual-Stream DPRNN)
☆21Jul 15, 2021Updated 5 years ago
yinruiqing / fsmn
View on GitHub
Feedforward Sequential Memory Networks
☆18Aug 2, 2022Updated 3 years ago
BUTSpeechFIT / cgmm_mvdr_online
View on GitHub
Implementation of CGMM-MVDR beamforming used for Clarity challenge
☆14Jan 14, 2022Updated 4 years ago
Andong-Li-speech / TaylorBeamformer
View on GitHub
The implementation of TaylorBeamformer, which is in submission to Interspeech2022
☆49Jun 10, 2022Updated 4 years ago
haoxiangsnr / spiking-fullsubnet
View on GitHub
Official repository of Spiking-FullSubNet, the Intel N-DNS Challenge Algorithmic Track Winner.
☆141Jan 28, 2026Updated 5 months ago
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
TuZehai / Sheffield_Clarity_CEC1_Entry
View on GitHub
Implementation of Sheffield entry for Clarity enhancement challenge.
☆18Apr 19, 2022Updated 4 years ago
jwr1995 / DTCN
View on GitHub
☆19Oct 26, 2023Updated 2 years ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
liyunlongaaa / NSD-MS2S
View on GitHub
CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…
☆88Jun 17, 2025Updated last year
hmartelb / avlit
View on GitHub
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…
☆20Sep 1, 2023Updated 2 years ago
zaocan666 / DyViSE
View on GitHub
Dynamic vision-guided speaker embedding for audio-visual speaker diarization
☆12Jul 5, 2022Updated 4 years ago
xmos / lib_agc
View on GitHub
Automatic gain control library
☆15Jul 13, 2024Updated 2 years ago
KunHanKH / GE2E_Speaker_Verification
View on GitHub
Most Complete Pytorch Imeplementation "GENERALIZED END-TO-END LOSS FOR SPEAKER VERIFICATION"
☆10Mar 11, 2020Updated 6 years ago
Maokui-He / NSD-MA-MSE
View on GitHub
A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"
☆62Sep 19, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Sreyan88 / RECAP
View on GitHub
Code for ICASSP 2024 Paper: RECAP: Retrieval-Augmented Audio Captioning
☆16Jun 23, 2024Updated 2 years ago
BUTSpeechFIT / EEND_dataprep
View on GitHub
☆59Mar 28, 2025Updated last year
xiaoxiaomiao323 / MSA
View on GitHub
☆16Feb 19, 2026Updated 5 months ago
qiuqiangkong / mini_llm
View on GitHub
☆29Jul 4, 2025Updated last year
lucacoma / NeuralBeamspaceDomainFilter
View on GitHub
Unofficial Implementation of "Liu, W., Li, A., Wang, X., Yuan, M., Chen, Y., Zheng, C., & Li, X. (2022). A Neural Beamspace-Domain Filter…
☆19Oct 21, 2022Updated 3 years ago
spkgyk / RTFS-Net
View on GitHub
Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024
☆51Oct 14, 2025Updated 9 months ago
Andong-Li-speech / TaEr
View on GitHub
This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…
☆14Nov 25, 2022Updated 3 years ago
joonaskalda / PixIT
View on GitHub
Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…
☆105Jan 10, 2025Updated last year
mchijmma / modeling-nonlinear
View on GitHub
Modeling of nonlinear audio effects with end-to-end deep neural networks - website:
☆17May 11, 2020Updated 6 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
SpeechClub / CDER_Metric
View on GitHub
CDER (Conversational Diarization Error Rate) Scoring Tool
☆22Sep 13, 2022Updated 3 years ago
microsoft / NOTSOFAR1-Challenge
View on GitHub
NOTSOFAR-1 Challenge: Distant Diarization and ASR
☆65Feb 12, 2025Updated last year
dihardchallenge / dihard3_baseline
View on GitHub
☆30Jul 21, 2022Updated 4 years ago
HaoFengyuan / EEND-IAAE
View on GitHub
The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…
☆11Aug 27, 2023Updated 2 years ago
Audio-WestlakeU / FS-EEND
View on GitHub
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆183May 7, 2026Updated 2 months ago
Andong-Li-speech / G2Net
View on GitHub
The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP
☆19Apr 27, 2022Updated 4 years ago
NARUTO-2024 / WavBench
View on GitHub
WavBench: Benchmarking Reasoning, Colloquialism, and Paralinguistics for End-to-End Spoken Dialogue Models
☆34Feb 13, 2026Updated 5 months ago