bugbakery / pydiar
simple to use, pretrained/training-less models for speaker diarization
☆21Updated last year
Alternatives and similar repositories for pydiar
Users that are interested in pydiar are comparing it to the libraries listed below
Sorting:
- Creation of a multi user audio first annotation tool - GSoC 2021☆29Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆106Updated 2 weeks ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆14Updated 2 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- phone inventory library☆16Updated 2 years ago
- ☆22Updated 3 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- ☆32Updated 3 years ago
- SpectroMap is a peak detection algorithm that computes the constellation map for a given signal☆31Updated 10 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your language☆33Updated this week
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 2 months ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆13Updated 3 months ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆39Updated 2 years ago
- Coqui Inference Engine☆40Updated 3 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorch☆21Updated 2 years ago
- Open Source AI Benchmarking toolkit for benchmarking speech to text services☆55Updated last year
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆16Updated last year
- A pipeline to isolate and transcribe one language in mixed-language speech☆18Updated 2 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.☆13Updated 5 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 5 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆49Updated 8 months ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.☆56Updated 5 months ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆22Updated last year
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆26Updated last year
- Script to train a German n-gram Language Model on articles of Wikipedia☆13Updated 6 years ago
- A Python-based modular toolbox for building Deep Neural Network models (using PyTorch) for statistical parametric speech synthesis☆23Updated 3 years ago
- Praaline is an open-source system to manage, annotate, visualise and analyse spoken language corpora☆29Updated 2 years ago