bugbakery / pydiarLinks
simple to use, pretrained/training-less models for speaker diarization
☆21Updated 2 years ago
Alternatives and similar repositories for pydiar
Users that are interested in pydiar are comparing it to the libraries listed below
Sorting:
- Evaluation of STT models for german language☆15Updated 3 years ago
- cologne-phonetics implementation in python☆17Updated last year
- phone inventory library☆17Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Updated 2 years ago
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆18Updated 8 months ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Updated 4 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆38Updated 9 months ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆111Updated 6 months ago
- Forced Alignments for Common Voice☆31Updated 5 years ago
- REST api for mozilla deepspeech voice recognition engine☆20Updated 4 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated last year
- Grapheme to phoneme model for PyTorch☆40Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 9 months ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆40Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Coqui Inference Engine☆41Updated 4 years ago
- ☆11Updated 4 years ago
- Command line tool to create corpora for Common Voice☆78Updated last week
- Speaker diarization python system based on binary key speaker modelling☆60Updated 3 years ago
- ☆14Updated 10 years ago
- Support tools for punctuation and boundary detection for ASR output.☆56Updated 3 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆15Updated 6 years ago
- Speaker diarization service☆25Updated 5 months ago
- Simple WFST for Ukrainian ITN based on NVIDIA NeMo and Pynini☆19Updated last month
- ☆57Updated 2 years ago
- This is a text-processing frontend that converts graphemes to phonemes and then further converts those phonemes into articulatory feature…☆13Updated last year
- ☆76Updated 4 years ago
- Creation of a multi user audio first annotation tool - GSoC 2021☆29Updated 2 years ago
- Script to train a German n-gram Language Model on articles of Wikipedia☆13Updated 7 years ago