pyannote / pyannote-pipelineLinks
Tunable pipelines
☆34Updated 3 months ago
Alternatives and similar repositories for pyannote-pipeline
Users that are interested in pyannote-pipeline are comparing it to the libraries listed below
Sorting:
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 3 months ago
- ☆40Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆51Updated last week
- ☆103Updated last week
- ☆54Updated last year
- Advanced data structures for handling temporal segments with attached labels.☆113Updated 3 months ago
- A curated list of awesome voice activity detection☆54Updated 6 months ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- ☆33Updated 3 years ago
- ☆26Updated 4 months ago
- ☆19Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆38Updated 3 years ago
- ☆56Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆73Updated 9 months ago
- Zero-shot Domain-sensitive Speech Recognition with Prompt-conditioning Fine-tuning (ASRU2023)☆27Updated last year
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆136Updated 3 months ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 9 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- ☆22Updated 3 years ago
- Putting flows on top of neural transducers for better TTS☆62Updated last week
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆90Updated 4 months ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆43Updated 2 years ago
- VoiceBox neural network implementation☆108Updated 10 months ago
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆119Updated 3 years ago
- Various speech datasets made available to the public☆118Updated 5 months ago
- Implementation of Google's USM speech model in Pytorch☆31Updated last month
- Predicts the level of noise and reverberation on your audiofiles☆151Updated last year
- a lightweight voice conversion☆82Updated 9 months ago