Chaanks / stkliaLinks
simple version of our torch kaldi toolkit, developed at the LIA by 2 apprentices. (@Chaanks & @vbrignatz)
☆10Updated 3 years ago
Alternatives and similar repositories for stklia
Users that are interested in stklia are comparing it to the libraries listed below
Sorting:
- Segment an audio file and obtain utterance alignments. (Python package)☆340Updated last year
- Variational Bayes HMM over x-vectors diarization☆275Updated last year
- A pure python module for reading and writing kaldi ark files☆260Updated 6 months ago
- End-to-End Neural Diarization☆406Updated 4 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆459Updated 2 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆87Updated last year
- Moved to https://github.com/k2-fsa/icefall☆146Updated 2 years ago
- Versatile Evaluation of Speech and Audio☆319Updated 3 weeks ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆222Updated 6 months ago
- Spot the conversation: speaker diarisation in the wild☆144Updated 3 years ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆116Updated 10 months ago
- Diarization scoring tools.☆256Updated 2 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Updated 4 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆76Updated 2 years ago
- UT-Sarulab MOS prediction system using SSL models☆258Updated last year
- BUT Multilingual Bottleneck Features☆15Updated 6 years ago
- Libri-CSS: dataset and evaluation pipeline☆147Updated 2 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆108Updated 2 years ago
- Script to perform statistical significance test between ASR hypotheses.☆22Updated 8 years ago
- Various speech datasets made available to the public☆128Updated 8 months ago
- ☆228Updated last year
- Example code for a neural transducer model.☆66Updated last year
- dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.☆508Updated 2 years ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆374Updated last year
- A library for speech data augmentation in time-domain☆671Updated 4 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆284Updated last year
- VB Diarization with Eigenvoice and HMM Priors, refactored☆15Updated 4 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆143Updated 2 years ago
- Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.☆81Updated 2 months ago
- A CRF-based ASR Toolkit☆348Updated 2 months ago