pyannote / pyannote-pipeline
Tunable pipelines
☆31Updated 2 weeks ago
Alternatives and similar repositories for pyannote-pipeline:
Users that are interested in pyannote-pipeline are comparing it to the libraries listed below
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆78Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆94Updated 2 weeks ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- ☆65Updated 2 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆109Updated last week
- ☆21Updated 5 months ago
- ☆57Updated 11 months ago
- Clustering-based methods for overlapping diarization☆74Updated last year
- ☆56Updated 2 years ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆74Updated 2 weeks ago
- ☆19Updated last year
- Audio tokenization, in the fastest way possible!☆46Updated 5 months ago
- ☆31Updated 9 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆90Updated 3 months ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆36Updated last year
- MeetEval - A meeting transcription evaluation toolkit☆84Updated last month
- ☆33Updated 3 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆47Updated this week
- Putting flows on top of neural transducers for better TTS☆63Updated this week
- Advanced data structures for handling temporal segments with attached labels.☆105Updated 2 weeks ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated last year
- A toolkit for processing speech data and creating speech datasets☆104Updated this week
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 8 months ago
- ☆38Updated last year
- AudioBench: A Universal Benchmark for Audio Large Language Models☆116Updated last week
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆115Updated 2 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆60Updated 2 years ago
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆79Updated 7 months ago
- ☆33Updated 3 weeks ago
- Standalone implementation of the CUDA-accelerated WFST Decoder available in Riva☆83Updated last month