Chaanks / stklia

simple version of our torch kaldi toolkit, developed at the LIA by 2 apprentices. (@Chaanks & @vbrignatz)

☆10

Alternatives and similar repositories for stklia:

Users that are interested in stklia are comparing it to the libraries listed below

NickRuiz / power-asr
Phonetically-Oriented Word Error Rate
☆34Updated 5 years ago
luferrer / ConfidenceIntervals
Confidence interval computation for evaluation in machine learning using the bootstrapping approach
☆80Updated last year
IDRnD / VoxTube
The VoxTube dataset official repository
☆68Updated last year
ankitapasad / layerwise-analysis
Layer-wise analysis of self-supervised pre-trained speech representations
☆102Updated 5 months ago
nttcslab-sp / EEND-vector-clustering
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…
☆76Updated 2 years ago
zhenghuatan / rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …
☆133Updated last year
xiuwenz2 / SAPC-template
☆11Updated last week
Xflick / EEND_PyTorch
A PyTorch implementation of End-to-End Neural Diarization
☆106Updated last year
talhanai / wer-sigtest
Script to perform statistical significance test between ASR hypotheses.
☆22Updated 7 years ago
csukuangfj / transducer-loss-benchmarking
☆68Updated 3 years ago
audiolabs / torch-pesq
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
☆183Updated last year
bfs18 / tacotron2
Tacotron 2 - PyTorch implementation with faster-than-realtime inference
☆51Updated 5 years ago
k2-fsa / fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆141Updated last year
Voice-Privacy-Challenge / Voice-Privacy-Challenge-2024
Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software
☆52Updated 2 months ago
idiap / acoustic-simulator
Implementation of audio degradation processes
☆102Updated 9 years ago
chenzhuo1011 / libri_css
Libri-CSS: dataset and evaluation pipeline
☆143Updated 2 years ago
RF5 / simple-speaker-embedding
A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.
☆88Updated 2 weeks ago
marianne-m / brouhaha-vad
Predicts the level of noise and reverberation on your audiofiles
☆148Updated 10 months ago
CSTR-Edinburgh / ophelia
Sequence-to-sequence TTS based on Kyubyong's dc_tts
☆60Updated 2 years ago
idiap / pkwrap
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
☆73Updated 2 years ago
iiscleap / NISP-Dataset
☆29Updated 2 years ago
zhenghuatan / rVADfast
This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…
☆137Updated 4 months ago
jtrmal / kaldi2020
☆27Updated 4 years ago
BUTSpeechFIT / VBx
Variational Bayes HMM over x-vectors diarization
☆268Updated last year
Takaaki-Saeki / DiscreteSpeechMetrics
Reference-aware automatic speech evaluation toolkit
☆152Updated 4 months ago
DigitalPhonetics / speaker-anonymization
Speaker anonymization pipeline for hiding the identity of the speaker of a recording by changing the voice in it.
☆72Updated 7 months ago
nttcslab-sp / kaldiio
A pure python module for reading and writing kaldi ark files
☆256Updated last month
farisalasmary / wav2vec2-kenlm
Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding
☆75Updated 3 years ago
lorenlugosch / transducer-tutorial
Example code for a neural transducer model.
☆61Updated last year
k2-fsa / snowfall
Moved to https://github.com/k2-fsa/icefall
☆144Updated 2 years ago