bugbakery / pydiarLinks
simple to use, pretrained/training-less models for speaker diarization
☆21Updated last year
Alternatives and similar repositories for pydiar
Users that are interested in pydiar are comparing it to the libraries listed below
Sorting:
- cologne-phonetics implementation in python☆17Updated last year
- Evaluation of STT models for german language☆15Updated 3 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆34Updated this week
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆15Updated 2 years ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆56Updated 6 months ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated 8 months ago
- Creation of a multi user audio first annotation tool - GSoC 2021☆29Updated 2 years ago
- Python library for handling audio datasets.☆138Updated last year
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆47Updated 2 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆107Updated last week
- Simple audio recorder that sends WAV from browser to server in Python (Flask).☆31Updated 2 years ago
- A semi-supervised sequence-to-sequence ASR☆10Updated 2 years ago
- ☆11Updated last week
- Speaker diarization service☆23Updated last month
- Coqui Inference Engine☆40Updated 3 years ago
- REST api for mozilla deepspeech voice recognition engine☆20Updated 3 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆55Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- phone inventory library☆16Updated 2 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 2 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆29Updated 2 months ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆14Updated 4 months ago
- Fast and accurate natural language detection. Detector written in Python. Nito-ELD, ELD.☆17Updated last year
- ☆24Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 3 months ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆35Updated 3 years ago