mirix/approaches-to-diarisation

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mirix/approaches-to-diarisation)

mirix / approaches-to-diarisation

A testing repo to share code and thoughts on diarisation

☆58

Alternatives and similar repositories for approaches-to-diarisation

Users that are interested in approaches-to-diarisation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Jose-Sabater / whisper-pyannote
View on GitHub
Whisper from OpenAi and diarization with Pyannote
☆52Jan 7, 2024Updated 2 years ago
apple / ml-acn-embed
View on GitHub
Acoustic Neighbor Embeddings
☆33Jul 13, 2025Updated last year
NavodPeiris / speechlib
View on GitHub
Speechlib is a library that unifies speaker diarization, transcription and speaker recognition in a single pipeline to create transcripts…
☆266Apr 19, 2026Updated 3 months ago
WhissleAI / PromptingNemo
View on GitHub
All-in-one Speech Transcription
☆11Jun 5, 2026Updated last month
bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
MuSAELab / AUDDT
View on GitHub
A toolkit for benchmarking on a wide variety of audio deepfake datasets.
☆36May 22, 2026Updated 2 months ago
hedrergudene / asr-sd-pipeline
View on GitHub
Speech recognition & diarisation solution with text alignment, deployed in AML pipelines
☆102May 7, 2024Updated 2 years ago
jianfch / stable-ts
View on GitHub
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
☆2,282May 30, 2026Updated last month
fabianoluzbr / neural-g2p-portuguese
View on GitHub
Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…
☆19Jun 14, 2021Updated 5 years ago
zacharyhorvitz / ParaGuide
View on GitHub
Code for "ParaGuide: Guided Diffusion Paraphrasers for Plug-and-Play Textual Style Transfer"
☆16Jul 17, 2024Updated 2 years ago
ghcli / gh-commit
View on GitHub
Artfully create commit messages that reflect the essence of your code changes. Craftsmanship for your commits.
☆21Jun 8, 2025Updated last year
hbwu-ntu / EmoCtrlTTS-Eval
View on GitHub
☆19Aug 23, 2024Updated last year
v-nhandt21 / MusicVoiceConversion
View on GitHub
Sing any popular song with your voice
☆11Jul 10, 2022Updated 4 years ago
ikegami-yukino / sengiri
View on GitHub
Yet another sentence-level tokenizer for the Japanese text
☆24Nov 27, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
cvqluu / simple_diarizer
View on GitHub
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
☆158May 2, 2024Updated 2 years ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
pengzhendong / asr-decoder
View on GitHub
CTC decoder with hotwords for ASR.
☆38Jun 15, 2026Updated last month
qurator-spk / sbb_ocr_postcorrection
View on GitHub
Two-Step Approach to OCR Post-Correction
☆14May 24, 2024Updated 2 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
v-nhandt21 / ViMFA
View on GitHub
Montreal Forced Aligner for Vietnamese
☆15Oct 23, 2023Updated 2 years ago
Wikidepia / g2p-id
View on GitHub
Indonesian Grapheme-to-Phoneme (IPA notation)
☆42Feb 21, 2026Updated 5 months ago
RomanKlimov / faster-whisper-acceleration
View on GitHub
Accelerating faster-whisper single file processing by multiprocessing through parallelization
☆57Apr 18, 2023Updated 3 years ago
geekodour / wscribe-editor
View on GitHub
web based editor for subtitles and transcripts
☆147Aug 16, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
daanzu / py-silero-vad-lite
View on GitHub
Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies
☆17Nov 25, 2024Updated last year
astaileyyoung / CineFace
View on GitHub
☆30Jul 10, 2026Updated 2 weeks ago
nhattruongpham / mmser
View on GitHub
SERVER: Multi-modal Speech Emotion Recognition using Transformer-based and Vision-based Embeddings
☆15Jan 23, 2024Updated 2 years ago
sholokhovalexey / online-speaker-clustering
View on GitHub
[ICASSP'23] Online speaker clustering
☆18Feb 22, 2026Updated 5 months ago
JaesungHuh / SimpleDiarization
View on GitHub
Simple diarization model
☆53Jun 13, 2025Updated last year
LBBNetwork / libsandwich
View on GitHub
☆15Aug 11, 2012Updated 13 years ago
HydroXai / Enhancing-Safety-in-Large-Language-Models
View on GitHub
Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…
☆12Nov 26, 2024Updated last year
MahmoudAshraf97 / whisper-diarization
View on GitHub
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
☆5,614Feb 23, 2026Updated 5 months ago
eXascaleInfolab / fashion_nlp_v2
View on GitHub
FashionBrain D2.1: Named Entity Recognition and Linking Methods
☆11Jun 26, 2019Updated 7 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
shreyas253 / SylNet
View on GitHub
SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech
☆27May 25, 2023Updated 3 years ago
r-dh / dutch-vl-tts
View on GitHub
Free Dutch voice dataset
☆13Jan 28, 2021Updated 5 years ago
leriel / automated-ipa-decrypt
View on GitHub
Fully automated ipa decrypt (requires mac and connected jailbroken ios device)
☆13Apr 23, 2022Updated 4 years ago
MisterCapi / auto_dataset_tts
View on GitHub
A simple script to prepare dataset for training with TTS Tortoise model via https://git.ecker.tech/mrq/ai-voice-cloning
☆12Jan 12, 2024Updated 2 years ago
philgzl / brever
View on GitHub
Speech enhancement in noisy and reverberant environments using deep neural networks
☆23Oct 10, 2025Updated 9 months ago
GuldenizBektas / paper-abstract-classifier
View on GitHub
☆10Jul 20, 2023Updated 3 years ago
guesswh0 / face_engine
View on GitHub
Facial recognition engine
☆10Jul 12, 2026Updated 2 weeks ago