bugbakery / pydiarLinks
simple to use, pretrained/training-less models for speaker diarization
☆21Updated 2 years ago
Alternatives and similar repositories for pydiar
Users that are interested in pydiar are comparing it to the libraries listed below
Sorting:
- Grapheme-to-phoneme tool for corpus conversion, where phonemes match Phoible inventories☆19Updated 9 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆38Updated 11 months ago
- Evaluation of STT models for german language☆15Updated 4 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Updated 2 years ago
- phone inventory library☆17Updated 2 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆41Updated 3 years ago
- Coqui Inference Engine☆40Updated 4 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 4 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆23Updated 4 years ago
- ☆22Updated 3 years ago
- Grapheme to phoneme model for PyTorch☆42Updated 3 years ago
- REST api for mozilla deepspeech voice recognition engine☆20Updated 4 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated last year
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 4 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆15Updated 6 years ago
- ☆22Updated 4 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- A corpus of diacritized Hebrew texts (טקסט מנוקד)☆11Updated 3 years ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆59Updated last year
- ☆14Updated 10 years ago
- Open Audio Search☆118Updated 2 years ago
- ☆76Updated 4 years ago
- A family of efficient speech models for multilingual phone recognition☆37Updated 3 months ago
- A collection of utilities for handling IPA phones.☆26Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆112Updated 3 weeks ago
- Python wrapper for phonetisaurus grapheme to phoneme tool☆12Updated 4 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆39Updated last week
- Support tools for punctuation and boundary detection for ASR output.☆55Updated 3 years ago