pyannote / pyannote-pipeline
Tunable pipelines
☆26Updated 3 weeks ago
Related projects: ⓘ
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆64Updated 11 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆79Updated 5 months ago
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆41Updated 2 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆95Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆39Updated 2 months ago
- Clustering-based methods for overlapping diarization☆68Updated 8 months ago
- ☆16Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- ☆48Updated last month
- ☆30Updated 7 months ago
- ☆41Updated 7 months ago
- Putting flows on top of neural transducers for better TTS☆63Updated last month
- ☆31Updated 2 weeks ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆74Updated 2 months ago
- ☆17Updated last year
- ☆56Updated last year
- Audio Large Language Models☆59Updated this week
- Online streaming speaker change detection model in Pytorch☆34Updated last year
- asr2k☆48Updated 3 months ago
- ☆22Updated 3 years ago
- 56 language, 1 model Multilingual ASR☆23Updated 3 years ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆56Updated 4 years ago
- An online speech recognition extension toolkit of Kaldi☆57Updated 3 years ago
- ☆15Updated last month
- Audio tokenization, in the fastest way possible!☆43Updated 3 weeks ago
- ONNX Inference of Pyannote Segmentation☆54Updated last week
- Transcribing Speech with Multinomial Diffusion, training code and models.☆74Updated 11 months ago
- ☆38Updated last year
- ☆57Updated 2 weeks ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 7 months ago