Picovoice / speaker-diarization-benchmark
Speaker diarization benchmark framework
☆20Updated last year
Alternatives and similar repositories for speaker-diarization-benchmark:
Users that are interested in speaker-diarization-benchmark are comparing it to the libraries listed below
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- ☆26Updated last month
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- ☆10Updated last week
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆82Updated last year
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆23Updated 2 weeks ago
- Goodness of Pronunciation algorithm using PyKaldi☆15Updated 2 years ago
- Clustering-based methods for overlapping diarization☆78Updated last year
- The case study and multilingfual performance of ICASSP submission☆23Updated 2 years ago
- Code for our INTERSPEECH paper Simul-Whisper: Attention-Guided Streaming Whisper with Truncation Detection☆60Updated last month
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- C++ version of pyannote audio overlapped speech detection pipeline☆12Updated last year
- Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.☆12Updated 2 weeks ago
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 9 months ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- asr2k☆49Updated 9 months ago
- ☆61Updated last year
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆21Updated last year
- ☆31Updated 11 months ago
- Predicts the level of noise and reverberation on your audiofiles☆148Updated 10 months ago
- Token-Level Ensemble Distillation for Grapheme-to-Phoneme Conversion☆20Updated 5 years ago
- A SPMI Lab toolkit for language models.☆11Updated 7 years ago
- ☆17Updated 3 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆68Updated 7 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- ☆12Updated 2 months ago
- Official Code for ParrotTTS☆48Updated 5 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- A handy dataset of noises for ASR☆20Updated 5 years ago