meronym/speaker-diarization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/meronym/speaker-diarization)

meronym / speaker-diarization

Speaker diarization model

☆31

Alternatives and similar repositories for speaker-diarization

Users that are interested in speaker-diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

meronym / speaker-transcription
View on GitHub
Transcription with speaker diarization pipeline
☆101Apr 27, 2023Updated 3 years ago
MuSAELab / AUDDT
View on GitHub
A toolkit for benchmarking on a wide variety of audio deepfake datasets.
☆35May 22, 2026Updated 2 months ago
RemiRigal / snreval-python
View on GitHub
This repository provides a small Python wrapper for the Matlab tool SNR Eval provided by Labrosa: https://labrosa.ee.columbia.edu/project…
☆12Jun 22, 2022Updated 4 years ago
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
isjwdu / DFADD
View on GitHub
Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset
☆16Apr 7, 2025Updated last year
Yaselley / SSL_Layerwise_Deepfake
View on GitHub
SSL Layerwise analysis for speech deepfake detection
☆36Aug 5, 2025Updated 11 months ago
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
resemble-ai / normalise
View on GitHub
A module for normalising text.
☆10Nov 6, 2019Updated 6 years ago
srvk / srvk_education
View on GitHub
Educational tutorials for speech and language processing classes
☆12Jan 8, 2019Updated 7 years ago
ETZET / SpeechEmotionAVLearning
View on GitHub
☆13Nov 25, 2023Updated 2 years ago
tbdsux / koyo
View on GitHub
Website screenshot service api on Deta Space
☆13Jun 6, 2023Updated 3 years ago
REAL-Lab-NU / Awesome-OpenClaw-Papers
View on GitHub
Official companion repository for our survey "A Survey of the OpenClaw Ecosystem: From Platform Extensibility to Constraint Design" — a c…
☆19May 31, 2026Updated last month
WangHelin1997 / AT-GCN
View on GitHub
Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network
☆15Sep 18, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hbwu-ntu / EmoCtrlTTS-Eval
View on GitHub
☆19Aug 23, 2024Updated last year
hieuthi / LlamaPartialSpoof
View on GitHub
A fully and partially fake speech dataset for evaluation
☆15Nov 11, 2025Updated 8 months ago
oscarknagg / raw-audio-gender-classification
View on GitHub
Machine learning experiment to perform gender classification from raw audio.
☆23Sep 1, 2018Updated 7 years ago
myguidingstar-zz / vie-hts
View on GitHub
Vietnamese Human-based Text-to-Speech
☆13Sep 9, 2012Updated 13 years ago
stuicey / wsProxy
View on GitHub
A websocket to tcp proxy, written in node.js, ment for roBrowser users, but can be used for other purposes.
☆15Jun 8, 2017Updated 9 years ago
GeorgeDavila / Renify
View on GitHub
☆12Jul 6, 2021Updated 5 years ago
RimoChan / arxiv-translate-fix
View on GitHub
arxiv翻译修复器！
☆22Nov 13, 2024Updated last year
falabrasil / gitlab-resources
View on GitHub
This is a legacy repo. Dev occurs now on GitHub.
☆11Mar 28, 2021Updated 5 years ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
heraclex12 / vietpunc
View on GitHub
Vietnamese Punctuation Prediction using Pretrained Language Models
☆14May 8, 2022Updated 4 years ago
winkjs / showcase-wiz
View on GitHub
🧙🏽‍♂️ Visualize wink-nlp's features
☆11Dec 9, 2020Updated 5 years ago
yoyolicoris / eva
View on GitHub
A screaming vocal samples dataset.
☆13Apr 14, 2023Updated 3 years ago
InbalRim / A-Study-On-Data-Augmentation-In-Voice-Anti-Spoofing
View on GitHub
☆10Jul 27, 2021Updated 5 years ago
shangwei5 / STGTN
View on GitHub
Aggregating Long-term Sharp Features via Hybrid Transformers for Video Deblurring
☆13Feb 14, 2025Updated last year
cuichenrui2000 / barry_speech_tools
View on GitHub
This repository documents Barry's journey in learning deep learning for speech processing. Here, you'll find scripts and code snippets re…
☆13Oct 8, 2025Updated 9 months ago
lucataco / cog-ip_adapter-sdxl-face
View on GitHub
Attempt at cog wrapper for IP_Adapter-face for SDXL
☆15Nov 25, 2024Updated last year
fofr / audio-to-waveform
View on GitHub
Convert an audio file to a waveform video
☆11Nov 10, 2023Updated 2 years ago
HudZah / HowSafeIsSF
View on GitHub
How Safe is SF?
☆10Aug 20, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
stathwang / POS-Taggers
View on GitHub
Part-of-Speech Tagging Models in Python
☆16Oct 7, 2019Updated 6 years ago
SlumberDemon / EasyCdn
View on GitHub
🚀 Host your own cdn in seconds
☆11Feb 28, 2023Updated 3 years ago
Sreyan88 / CompA
View on GitHub
Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
☆23Jul 10, 2024Updated 2 years ago
sol-prog / opencv-video-editing
View on GitHub
Code for "OpenCV video editing tutorial"
☆14Apr 21, 2018Updated 8 years ago
CircuitMess / Chatter-Firmware
View on GitHub
☆12Aug 4, 2025Updated 11 months ago
kuco23 / pokerlib
View on GitHub
Python poker library
☆14Sep 9, 2023Updated 2 years ago
jnwnlee / selva
View on GitHub
[CVPR 2026] Official PyTorch implementation of SelVA "Hear What Matters! Text-conditioned Selective Video-to-Audio Generation"
☆15Mar 27, 2026Updated 4 months ago