pyannote/pyannote-pipeline

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pyannote/pyannote-pipeline)

pyannote / pyannote-pipeline

Tunable pipelines

☆41

Alternatives and similar repositories for pyannote-pipeline

Users that are interested in pyannote-pipeline are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pyannote / pyannote-core
View on GitHub
Advanced data structures for handling temporal segments with attached labels.
☆124Sep 16, 2025Updated 10 months ago
FrenchKrab / IS2023-powerset-diarization
View on GitHub
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
☆96Oct 18, 2023Updated 2 years ago
Mu-Y / DiariST
View on GitHub
☆18Sep 19, 2023Updated 2 years ago
pyannote / pyannote-database
View on GitHub
Reproducible experimental protocols for multimedia (audio, video, text) database
☆119Mar 1, 2026Updated 4 months ago
Podcastindex-org / podping
View on GitHub
A global message bus for podcast feed events.
☆18Jan 11, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
FrenchKrab / datasets-pyannote
View on GitHub
Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)
☆15Oct 22, 2025Updated 8 months ago
VoxBlink / ScriptsForVoxBlink
View on GitHub
A repo containing download guidance and corresponding scripts of the VoxBlink dataset.
☆30Apr 16, 2024Updated 2 years ago
hbredin / pyannotebook
View on GitHub
🎹 pyannote + 🗒 notebook = pyannotebook
☆27Jun 12, 2023Updated 3 years ago
pkufool / simple-wer
View on GitHub
A simple command line tool to calculate WER for ASR.
☆14Oct 14, 2024Updated last year
OpenLLM-France / Lit-Claire
View on GitHub
Continual pretraining of foundation LLM using ⚡ Lightning Fabric
☆37Nov 27, 2024Updated last year
fireredchat-submodules / livekit-plugins-fireredchat-pvad
View on GitHub
FireRedChat pVAD plugin for LiveKit Agents
☆22Sep 16, 2025Updated 10 months ago
popcornell / FastMSS
View on GitHub
☆32May 18, 2026Updated 2 months ago
pyannote / hf-speaker-diarization-3.1
View on GitHub
Mirror of hf.co/pyannote/speaker-diarization-3.1
☆33Jan 7, 2024Updated 2 years ago
pyannote / pyannote-metrics
View on GitHub
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
☆252May 19, 2026Updated 2 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
fgnt / mms_msg
View on GitHub
Multipurpose Multi Speaker Mixture Signal Generator
☆46Feb 6, 2025Updated last year
isjwdu / DFADD
View on GitHub
Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset
☆16Apr 7, 2025Updated last year
yucongzh / online_speaker_diarization
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
tango4j / Auto-Tuning-Spectral-Clustering
View on GitHub
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
☆125Apr 8, 2022Updated 4 years ago
BUTSpeechFIT / EEND
View on GitHub
☆95Apr 24, 2025Updated last year
Speech-Lab-IITM / CCC-wav2vec-2.0
View on GitHub
Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…
☆23Mar 18, 2024Updated 2 years ago
juanmc2005 / rttm-viewer
View on GitHub
Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way
☆48Apr 19, 2023Updated 3 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
shashikg / X-Vector-Based-Speaker-Diarization
View on GitHub
Course project for EE698R (2020-21 Sem 2). An X-Vector Based Speaker Diarization System with AutoEncoder based clustering method. Also su…
☆16Jun 2, 2021Updated 5 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
Podcastindex-org / podping-hivewatcher
View on GitHub
A watcher script for the hive backed podping network.
☆17Jun 18, 2023Updated 3 years ago
the-astrosky-ecosystem / astronomy-feeds
View on GitHub
Repo of the Astronomy feeds on Bluesky.
☆18Jul 3, 2026Updated 2 weeks ago
ahmedshah1494 / speech_robust_bench
View on GitHub
☆18Apr 24, 2025Updated last year
BUTSpeechFIT / EEND_dataprep
View on GitHub
☆59Mar 28, 2025Updated last year
qiuqiangkong / materials_for_students
View on GitHub
☆16Aug 10, 2025Updated 11 months ago
desh2608 / dover-lap
View on GitHub
Python package for combining diarization system outputs.
☆94Oct 12, 2023Updated 2 years ago
andimarafioti / nano-parakeet
View on GitHub
Pure-PyTorch Parakeet TDT inference
☆49Mar 10, 2026Updated 4 months ago
BUTSpeechFIT / DeCRED
View on GitHub
☆18Aug 13, 2025Updated 11 months ago
Deep-unlearning / Finetune-Parakeet
View on GitHub
☆25Oct 22, 2025Updated 8 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
kohei0209 / self-remixing
View on GitHub
Official implementation of Self-Remixing
☆18Feb 3, 2024Updated 2 years ago
fgnt / graph_pit
View on GitHub
☆42Oct 14, 2022Updated 3 years ago
cpdu / vallt
View on GitHub
☆36Mar 14, 2025Updated last year
ddlBoJack / MT4SSL
View on GitHub
[INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…
☆45Mar 25, 2024Updated 2 years ago
DongKeon / Awesome-Speaker-Diarization
View on GitHub
Some comprehensive papers about speaker diarization
☆367Mar 24, 2026Updated 3 months ago
mkunes / w2v2_audioFrameClassification
View on GitHub
wav2vec2 audio classification for prosodic boundary detection and other tasks
☆42Aug 11, 2023Updated 2 years ago