bugbakery / pydiar
simple to use, pretrained/training-less models for speaker diarization
☆21Updated last year
Alternatives and similar repositories for pydiar:
Users that are interested in pydiar are comparing it to the libraries listed below
- Evaluation of STT models for german language☆15Updated 3 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆14Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated last year
- Unicode Standard tokenization routines and orthography profile segmentation☆35Updated last month
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- 🐍 Coqui's machine learning job scheduler☆32Updated 3 years ago
- ☆75Updated 3 years ago
- Survey of available speech datasets for Polish ASR development☆13Updated 3 months ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Updated 3 years ago
- phone inventory library☆16Updated last year
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆32Updated 4 years ago
- Speaker diarization service☆21Updated last month
- A handy dataset of noises for ASR☆20Updated 5 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 3 years ago
- A corpus of speech from the Joe Rogan Experience podcast, consisting of 8.43 million words. It includes aligned TextGrids with phonetic a…☆17Updated 5 years ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.☆51Updated 2 years ago
- Official Implementation of Mockingjay in Pytorch☆54Updated last year
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆27Updated 3 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- ☆22Updated 3 years ago
- ☆32Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 4 years ago
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- ☆56Updated 2 years ago
- Suite for phonetic word embeddings, especially their evaluation and baseline models.☆27Updated last month
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- Linguistic processing for Common Voice☆55Updated last year