shanguanma / AlignersLinks

HMM, CTC, RNN-Transducer, forward-backward algorithm

☆21

Alternatives and similar repositories for Aligners

Users that are interested in Aligners are comparing it to the libraries listed below

Sorting:

k2-fsa / multi_quantization
☆44Updated last year
HaoranMiao / streaming-attention
streaming attention networks for end-to-end automatic speech recognition
☆55Updated 5 years ago
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆31Updated 3 years ago
thu-spmi / ST-NAS
Efficient Neural Architecture Search via Straight-Through Gradients
☆13Updated 4 years ago
csukuangfj / kaldi-hmm-gmm
☆25Updated 9 months ago
desh2608 / css
PyTorch implementation of Continuous Speech Separation
☆13Updated 2 years ago
TeaPoly / CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆58Updated last year
danpovey / quantization
Torch-based tool for quantizing high-dimensional vectors using additive codebooks
☆54Updated 3 years ago
1ytic / warp-rna
Recurrent Neural Aligner
☆50Updated 5 years ago
csukuangfj / optimized_transducer
Memory efficient transducer loss computation
☆68Updated 3 years ago
Sytronik / deep-griffinlim-iteration
PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)
☆39Updated 5 years ago
luomingshuang / k2-speechbrain
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Updated 3 years ago
nii-yamagishilab / Intelligibility-MetricGAN
Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…
☆55Updated 2 years ago
xinjli / alqalign
multilingual speech aligner
☆75Updated last year
thuhcsi / NeuFA
Neural network-based forced alignment with bidirectional attention mechanism
☆77Updated 6 months ago
hainan-xv / PASM
Pronunciation-assisted Subword Modeling
☆30Updated 6 years ago
placebokkk / ctc-asr
pytorch CTC implementation for ASR. Use eesen's fst decoder framework
☆10Updated 5 years ago
k2-fsa / kaldifst
Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files
☆53Updated 3 months ago
hbredin / DomainAdversarialVoiceActivityDetection
Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"
☆23Updated 5 years ago
athena-team / athena-decoder
☆76Updated 3 years ago
Alexander-H-Liu / NPC
Non-Autoregressive Predictive Coding
☆51Updated 4 years ago
pzelasko / kaldialign
Python wrappers for Kaldi Levenshtein's distance and alignment code.
☆67Updated 2 months ago
wenet-e2e / WeSpeech-AI
Open Source Speech/Text Data on AI
☆18Updated 2 years ago
Hertin / WavPrompt
☆36Updated 3 years ago
rishikksh20 / PPSpeech
PPSpeech: Phrase based Parallel End-to-End TTS System
☆35Updated 4 years ago
desh2608 / pytorch-tdnn
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆40Updated 4 years ago
iamjanvijay / rnnt_decoder_cuda
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
☆68Updated 4 years ago
mechanicalsea / lighthubert
LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT
☆74Updated 2 years ago
danpovey / lilcom
Small compression utility
☆37Updated 4 months ago
idiap / pkwrap
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
☆73Updated 3 years ago