wangyu09 / exkaldi-rtLinks

An online speech recognition extension toolkit of Kaldi

☆56

Alternatives and similar repositories for exkaldi-rt

Users that are interested in exkaldi-rt are comparing it to the libraries listed below

Sorting:

athena-team / athena-decoder
☆76Updated 3 years ago
idiap / icassp-oov-recognition
Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"
☆17Updated 4 years ago
JRMeyer / multi-task-kaldi
An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…
☆55Updated 5 years ago
YiwenShaoStephen / pychain_example
☆48Updated 4 years ago
idiap / acoustic-simulator
Implementation of audio degradation processes
☆105Updated 10 years ago
csukuangfj / kaldilm
Python wrapper for kaldi's arpa2fst
☆38Updated 3 months ago
idiap / pkwrap
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
☆73Updated 3 years ago
cornerfarmer / ctc_segmentation
Segment a given audio into utterances using a trained end-to-end ASR model.
☆74Updated 5 years ago
cageyoko / CTC-Attention-Mispronunciation
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆62Updated 4 years ago
MarkWuNLP / SemanticMask
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆39Updated 5 years ago
alumae / online_speaker_change_detector
Online streaming speaker change detection model in Pytorch
☆43Updated 2 years ago
FlorianKrey / DNC
Discriminative Neural Clustering for Speaker Diarisation
☆79Updated 3 years ago
asappresearch / multistream-cnn
Multistream CNN for Robust Acoustic Modeling
☆40Updated 4 years ago
LeBenchmark / Interspeech2021
This repository describes our reproducible framework for assessing self-supervised representation learning from speech
☆51Updated 4 years ago
HaoranMiao / streaming-attention
streaming attention networks for end-to-end automatic speech recognition
☆55Updated 5 years ago
RicherMans / Datadriven-GPVAD
The codebase for Data-driven general-purpose voice activity detection.
☆94Updated 2 years ago
zhenghuatan / rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …
☆137Updated last year
gooofy / kaldi-adapt-lm
Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model
☆33Updated 5 years ago
hbredin / DomainAdversarialVoiceActivityDetection
Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"
☆23Updated 5 years ago
desh2608 / diarizer
Clustering-based methods for overlapping diarization
☆81Updated last year
k2-fsa / fast_rnnt
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆146Updated 2 years ago
TeaPoly / Conformer-Athena
Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.
☆44Updated 3 years ago
hainan-xv / PASM
Pronunciation-assisted Subword Modeling
☆31Updated 6 years ago
DonkeyShot21 / uis-rnn-sml
A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)
☆62Updated 5 years ago
i3thuan5 / FaNT
Filtering and Noise Adding Tool
☆29Updated 3 years ago
Ephrem-ETH / E2E-KWS
End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM
☆42Updated 3 years ago
R1ckShi / AESRC2020
[ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…
☆56Updated 5 years ago
desh2608 / pytorch-tdnn
Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training
☆41Updated 4 years ago
funcwj / ge2e-speaker-verification
Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"
☆103Updated 6 years ago
wenet-e2e / WeTextProcessing.deprecated
☆61Updated 2 years ago