CiscoDevNet / vo-idLinks
☆11Updated 4 years ago
Alternatives and similar repositories for vo-id
Users that are interested in vo-id are comparing it to the libraries listed below
Sorting:
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆154Updated last year
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆184Updated last year
- Various speech datasets made available to the public☆130Updated last year
- ☆40Updated 4 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345Updated last year
- A live speech recognition using Facebooks wav2vec 2.0 model.☆376Updated 2 years ago
- Diarization scoring tools.☆263Updated 2 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆439Updated 5 months ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆71Updated 3 years ago
- ☆45Updated 3 years ago
- Code for the ICASSP 2022 paper "Transformer-Based Multi-Aspect Multi-Granularity Non-native English Speaker Pronunciation Assessment".☆196Updated 2 years ago
- ☆50Updated 3 years ago
- ☆357Updated last year
- Speech Toolkit for Malaysian language, https://malaya-speech.readthedocs.io/☆279Updated 4 months ago
- A python package for deep multilingual punctuation prediction.☆156Updated last year
- A lightweight library to compute Diarization Error Rate (DER).☆62Updated 3 weeks ago
- ONNX Inference of Pyannote Segmentation☆97Updated last year
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆125Updated 3 years ago
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆169Updated last month
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- A non-native English corpus for pronunciation scoring task☆165Updated 3 months ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 4 years ago
- A curated list of awesome voice activity detection☆71Updated last year
- ☆94Updated last year
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆45Updated 4 years ago
- Python server for communicating with Kaldi from the browser using WebRTC☆69Updated 2 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆467Updated 2 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆36Updated 5 years ago
- Deep Learning - one shot learning for speaker recognition using Filter Banks☆170Updated last year
- Triton backend for https://github.com/OpenNMT/CTranslate2☆35Updated 2 years ago