liutaocode/AwesomeDiarizationDataset

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/liutaocode/AwesomeDiarizationDataset)

liutaocode / AwesomeDiarizationDataset

Both audio-only and audio-visual speaker diarization datasets are listed here.

☆16

Alternatives and similar repositories for AwesomeDiarizationDataset

Users that are interested in AwesomeDiarizationDataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

liutaocode / DiarizationVisualization
View on GitHub
Visualization tools for audio-only and multi-modal speaker diarization dataset
☆13Oct 27, 2023Updated 2 years ago
SSTC-Challenge / SSTC2024_baseline_system
View on GitHub
☆12Jun 14, 2024Updated 2 years ago
Audio-WestlakeU / Mel-McNet
View on GitHub
The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]
☆26May 14, 2026Updated 2 months ago
Maokui-He / NSD-MA-MSE
View on GitHub
A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"
☆62Sep 19, 2024Updated last year
Audio-WestlakeU / UMA-ASR
View on GitHub
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
☆35Dec 17, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
lavendery / UUG
View on GitHub
☆21Sep 14, 2025Updated 10 months ago
yucongzh / online_speaker_diarization
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
yangdongchao / Omni-AutoThink
View on GitHub
Adaptive Multimodal Reasoning via Reinforcement Learning
☆23Jan 11, 2026Updated 6 months ago
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
Audio-WestlakeU / SAR-SSL
View on GitHub
A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…
☆40Oct 11, 2024Updated last year
BUTSpeechFIT / diacorrect
View on GitHub
Error correction back-end for speaker diarization
☆18Sep 26, 2023Updated 2 years ago
HHousen / speaker-change-detection
View on GitHub
Speaker change detection using SincNet and an LSTM/Transformer
☆57May 26, 2025Updated last year
Xflick / EEND_PyTorch
View on GitHub
A PyTorch implementation of End-to-End Neural Diarization
☆110Jun 19, 2023Updated 3 years ago
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
gxu82 / MVDR-Speech-Enhancement
View on GitHub
☆16Jul 14, 2020Updated 6 years ago
itsnotacie / AAAI-26_SepPrune
View on GitHub
SepPrune: Structured Pruning for Efficient Deep Speech Separation-AAAI'26
☆15May 31, 2025Updated last year
dodohow1011 / TS-VAD
View on GitHub
☆55Jan 15, 2021Updated 5 years ago
cpystan / PSM
View on GitHub
Exploring Unsupervised Cell Recognition with Prior Self-activation Maps (MICCAI 2023)
☆13Oct 27, 2023Updated 2 years ago
BUTSpeechFIT / DVBx
View on GitHub
Discriminative Training of VBx Diarization
☆28Sep 23, 2024Updated last year
audiosae / audio-sae
View on GitHub
Demo for AudioSAE paper
☆15Apr 26, 2026Updated 2 months ago
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
ASLP-lab / Smart-Glass-Challenge
View on GitHub
☆17Jun 16, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
IiuZiKai / Evo_TSE
View on GitHub
☆17Apr 9, 2026Updated 3 months ago
VoxBlink / ScriptsForVoxBlink
View on GitHub
A repo containing download guidance and corresponding scripts of the VoxBlink dataset.
☆30Apr 16, 2024Updated 2 years ago
AudenAI / Auden
View on GitHub
☆71Apr 2, 2026Updated 3 months ago
yufan-aslp / AliMeeting
View on GitHub
The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…
☆142Jun 10, 2022Updated 4 years ago
nryant / dscore
View on GitHub
Diarization scoring tools.
☆267Apr 8, 2026Updated 3 months ago
liutaocode / LivePortrait-Train
View on GitHub
Unoffical LivePortrait Training Script [ 🚧 Under Construction]
☆40Jan 28, 2025Updated last year
LuluW8071 / Automatic-Speech-Recognition-with-PyTorch
View on GitHub
Real-Time ASR with CNN-BiLSTM: End-to-End Live Streaming Using PyTorch Lightning⚡
☆11Jan 23, 2025Updated last year
idiap / bert-text-diarization-atc
View on GitHub
This is a repository for a paper accepted at the 2022 IEEE Spoken Language Technology Workshop (SLT 2022)
☆17Dec 1, 2022Updated 3 years ago
FrenchKrab / datasets-pyannote
View on GitHub
Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)
☆15Oct 22, 2025Updated 9 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
BUTSpeechFIT / EEND
View on GitHub
☆95Apr 24, 2025Updated last year
malradhi / PACodec
View on GitHub
[ICASSP 2026]Official code for "Prosody-Guided Harmonic Attention for Phase-Coherent Neural Vocoding in the Complex Spectrum"
☆27Jan 22, 2026Updated 6 months ago
Hunterhuan / sphereface2_speaker_verification
View on GitHub
Exploring Binary Classification Loss for Speaker Verification
☆18Jul 18, 2023Updated 3 years ago
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
Audio-WestlakeU / FN-SSL
View on GitHub
The Official PyTorch Implementation of FN-SSL & IPDnet for Sound Source Localization [INTERSPEECH2023 & TASLP2024]
☆159Mar 10, 2026Updated 4 months ago
laitselec / MuFun
View on GitHub
☆37Aug 31, 2025Updated 10 months ago
qinxiaoyi / Simple-Attention-Module-based-Speaker-Verification-with-Iterative-Noisy-Label-Detection
View on GitHub
☆12Jun 14, 2022Updated 4 years ago