revdotcom / fstalign
An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.
☆162Updated 3 weeks ago
Alternatives and similar repositories for fstalign:
Users that are interested in fstalign are comparing it to the libraries listed below
- Various speech datasets made available to the public☆113Updated 2 months ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆130Updated last year
- An online speech recognition extension toolkit of Kaldi☆56Updated 3 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆328Updated 9 months ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆64Updated 5 months ago
- Variational Bayes HMM over x-vectors diarization☆263Updated last year
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆140Updated last year
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆131Updated 2 months ago
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆59Updated 4 years ago
- Predicts the level of noise and reverberation on your audiofiles☆144Updated 9 months ago
- Diarization scoring tools.☆235Updated last year
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated last year
- Moved to https://github.com/k2-fsa/icefall☆144Updated 2 years ago
- ☆39Updated last year
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- ☆34Updated 5 months ago
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆116Updated 2 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Updated 4 years ago
- Implementation of audio degradation processes☆101Updated 9 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆73Updated 4 years ago
- Charsiu: A neural phonetic aligner.☆292Updated 2 years ago
- Clustering-based methods for overlapping diarization☆75Updated last year
- A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.☆136Updated 5 years ago
- ☆71Updated last year
- The People’s Speech Dataset☆101Updated last year
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆103Updated 2 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆77Updated 10 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆96Updated last week
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆75Updated 2 years ago
- A Python toolbox for speech features extraction☆161Updated 2 years ago