bugbakery / pydiarLinks
simple to use, pretrained/training-less models for speaker diarization
☆21Updated last year
Alternatives and similar repositories for pydiar
Users that are interested in pydiar are comparing it to the libraries listed below
Sorting:
- Evaluation of STT models for german language☆15Updated 3 years ago
- Speaker diarization service☆23Updated 3 weeks ago
- Coqui Inference Engine☆40Updated 3 years ago
- ☆53Updated 2 years ago
- phone inventory library☆16Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 4 months ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆58Updated 7 months ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆107Updated last month
- ☆32Updated 3 years ago
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆25Updated 4 months ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆56Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆41Updated 2 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 3 years ago
- Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.☆17Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your language☆37Updated this week
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- ☆11Updated 10 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated 10 months ago
- Open Audio Search☆116Updated 2 years ago
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆30Updated 4 months ago
- ISO 639 language codes☆45Updated 5 months ago
- ☆17Updated 2 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆15Updated 2 years ago
- ☆22Updated 4 years ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆27Updated 2 years ago
- C++ version of pyannote audio overlapped speech detection pipeline☆13Updated last year