Chaanks / stkliaLinks
simple version of our torch kaldi toolkit, developed at the LIA by 2 apprentices. (@Chaanks & @vbrignatz)
☆10Updated 4 years ago
Alternatives and similar repositories for stklia
Users that are interested in stklia are comparing it to the libraries listed below
Sorting:
- Segment an audio file and obtain utterance alignments. (Python package)☆345Updated last year
- Variational Bayes HMM over x-vectors diarization☆283Updated 2 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆467Updated 2 years ago
- A pure python module for reading and writing kaldi ark files☆267Updated 11 months ago
- End-to-End Neural Diarization☆421Updated 4 years ago
- Diarization scoring tools.☆263Updated 2 years ago
- ☆236Updated 2 years ago
- Multilingual G2P in 100 languages☆374Updated 2 years ago
- A library for speech data augmentation in time-domain☆682Updated 4 years ago
- Moved to https://github.com/k2-fsa/icefall☆146Updated 3 years ago
- Versatile Evaluation of Speech and Audio☆384Updated 2 months ago
- Large, modern dataset for speech recognition☆718Updated last year
- UniSpeech - Large Scale Self-Supervised Learning for Speech☆478Updated last year
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆379Updated last year
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Updated 5 years ago
- A CRF-based ASR Toolkit☆362Updated last week
- Libri-CSS: dataset and evaluation pipeline☆151Updated 3 years ago
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆439Updated 6 months ago
- Charsiu: A neural phonetic aligner.☆329Updated 3 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆95Updated last year
- A large-scale multilingual speech corpus for representation learning, semi-supervised learning and interpretation☆568Updated 2 years ago
- Spot the conversation: speaker diarisation in the wild☆157Updated 3 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆138Updated 2 years ago
- Speaker embedding (d-vector) trained with GE2E loss☆286Updated 2 years ago
- Grapheme to phoneme conversion with deep learning.☆420Updated 2 years ago
- UT-Sarulab MOS prediction system using SSL models☆294Updated last year
- Script to perform statistical significance test between ASR hypotheses.☆22Updated 8 years ago
- VB Diarization with Eigenvoice and HMM Priors, refactored☆15Updated 4 years ago
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆242Updated last month
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆169Updated last month