Simple diarization model
☆53Jun 13, 2025Updated 9 months ago
Alternatives and similar repositories for SimpleDiarization
Users that are interested in SimpleDiarization are comparing it to the libraries listed below
Sorting:
- Evaluation script for VoxMovies dataset in PyTorch☆23Jan 12, 2024Updated 2 years ago
- [CVPR'23 Highlight] AutoAD: Movie Description in Context.☆103Nov 6, 2024Updated last year
- ☆59Mar 28, 2025Updated 11 months ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- A neural speech codec based on discrete WavLM representations☆25Aug 28, 2024Updated last year
- Learnable STRF, from Riad et al. 2021 JASA☆13Aug 21, 2021Updated 4 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆71Mar 5, 2026Updated 2 weeks ago
- [ACCV 2024] Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, M…☆29Jan 28, 2025Updated last year
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆36Sep 9, 2025Updated 6 months ago
- Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)☆15Oct 22, 2025Updated 4 months ago
- Onnx compatible styletts2 code☆17Feb 28, 2026Updated 2 weeks ago
- ☆42Jan 22, 2024Updated 2 years ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆105Jan 10, 2025Updated last year
- Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns☆17Nov 15, 2022Updated 3 years ago
- Openfst mirror with some fixes☆15Aug 23, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆93Oct 18, 2023Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 11 months ago
- ☆30Jan 22, 2026Updated last month
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- ☆94Aug 2, 2023Updated 2 years ago
- ☆12Nov 7, 2024Updated last year
- The implementation of "End-to-End Neural Speaker Diarization with an Iterative Adaptive Attractor Estimation", which is accepted by Neura…☆11Aug 27, 2023Updated 2 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Jun 12, 2023Updated 2 years ago
- ☆52Oct 17, 2023Updated 2 years ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆33Jun 14, 2024Updated last year
- Behavioral probing of language acquisition models at the lexical and syntactic level☆18Jul 17, 2023Updated 2 years ago
- A simple command line tool to calculate WER for ASR.☆14Oct 14, 2024Updated last year
- Predicts the level of noise and reverberation on your audiofiles☆179Jun 17, 2025Updated 9 months ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆78Oct 18, 2022Updated 3 years ago
- Implementation of Transfer Learning from Speaker Verification to Multi-speaker Text-To-Speech Synthesis (SV2TTS) in Persian language.☆13Oct 2, 2025Updated 5 months ago
- ☆21Jul 15, 2024Updated last year
- ☆21Mar 7, 2025Updated last year
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆40Mar 13, 2024Updated 2 years ago
- Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion☆25Mar 16, 2023Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆155May 2, 2024Updated last year