JRMeyer/multi-task-kaldi

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/JRMeyer/multi-task-kaldi)

JRMeyer / multi-task-kaldi

An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 training.

☆55

Alternatives and similar repositories for multi-task-kaldi

Users that are interested in multi-task-kaldi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JRMeyer / kaldi-tf
View on GitHub
Scripts for exporting Kaldi labeled data into TensorFlow
☆12Jul 31, 2019Updated 6 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
uhh-lt / kaldi-model-server
View on GitHub
Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone
☆35Feb 18, 2022Updated 4 years ago
irebai / SpecAugment_KALDI
View on GitHub
A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆15Sep 4, 2019Updated 6 years ago
srinivr / kaldi-long-audio-alignment
View on GitHub
Long audio alignment using Kaldi
☆23Apr 22, 2021Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yh1008 / speech-to-text
View on GitHub
mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras
☆71Nov 20, 2017Updated 8 years ago
asappresearch / multistream-cnn
View on GitHub
Multistream CNN for Robust Acoustic Modeling
☆40Jun 17, 2021Updated 5 years ago
speechpro / mixup
View on GitHub
☆24Mar 13, 2020Updated 6 years ago
dspavankumar / keras-kaldi
View on GitHub
Keras Interface for Kaldi ASR
☆122Sep 27, 2017Updated 8 years ago
idiap / pkwrap
View on GitHub
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
☆73Jun 8, 2022Updated 4 years ago
danpovey / pocolm
View on GitHub
Small language toolkit for creation, interpolation and pruning of ARPA language models
☆92Aug 6, 2022Updated 3 years ago
BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
dense-analysis / vim-speech
View on GitHub
Vim Speech Recognition Experiments
☆20May 30, 2025Updated last year
JRMeyer / speakerID-challenge
View on GitHub
A recipe for creating a Speaker Identification system built on Kaldi.
☆15Jan 2, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RuABraun / texterrors
View on GitHub
☆37Jun 9, 2026Updated last month
JRMeyer / easy-kaldi
View on GitHub
Use your data to create a speech recognition system in Kaldi. Fast.
☆65Jan 2, 2020Updated 6 years ago
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
gooofy / kaldi-adapt-lm
View on GitHub
Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language model
☆33Jan 26, 2020Updated 6 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
dogancan / expected-edit-distance
View on GitHub
Expected edit distance implementation using OpenFst tools
☆11May 13, 2015Updated 11 years ago
srvk / lm_build
View on GitHub
Adapting your own Language Model for Kaldi
☆63Jan 8, 2019Updated 7 years ago
KarelVesely84 / kaldi-io-for-python
View on GitHub
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
☆378Jun 16, 2023Updated 3 years ago
kaituoxu / kaldi-ktnet1
View on GitHub
Kaldi extended by Kaituo XU with new features in nnet1.
☆12Dec 16, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
oxinabox / Kaldi-Notes
View on GitHub
Some notes on Kaldi
☆32Feb 20, 2015Updated 11 years ago
bioidiap / bob.bio.spear
View on GitHub
Run speaker recognition algorithms - Mirrored from https://gitlab.idiap.ch/bob/bob.bio.spear
☆19Jun 24, 2023Updated 3 years ago
XiaoMi / kaldi-onnx
View on GitHub
Kaldi model converter to ONNX
☆248Jan 27, 2023Updated 3 years ago
wangyu09 / exkaldi-rt
View on GitHub
An online speech recognition extension toolkit of Kaldi
☆55Jun 23, 2021Updated 5 years ago
google / asr-recipes
View on GitHub
☆17Jul 29, 2018Updated 7 years ago
datemoon / ASR-decoder
View on GitHub
it's ASR decoder and make graph project
☆33May 26, 2022Updated 4 years ago
csukuangfj / kaldilm
View on GitHub
Python wrapper for kaldi's arpa2fst
☆38Aug 27, 2025Updated 10 months ago
jzlianglu / pykaldi2
View on GitHub
Yet another speech toolkit based on Kaldi and PyTorch
☆173Jul 1, 2020Updated 6 years ago
freewym / espresso
View on GitHub
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆939Sep 4, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
KrishnaDN / BERTphone
View on GitHub
Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"
☆17Dec 10, 2020Updated 5 years ago
isca-sig-rosp / ISCA-SIG-RoSP
View on GitHub
Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)
☆11Dec 4, 2023Updated 2 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
desh2608 / dover-lap
View on GitHub
Python package for combining diarization system outputs.
☆94Oct 12, 2023Updated 2 years ago
moisesveleta / GOP-LSTM
View on GitHub
Improving the Goodness of Pronunciation with DNNs and RNNs
☆32Sep 26, 2018Updated 7 years ago
LeBenchmark / Interspeech2021
View on GitHub
This repository describes our reproducible framework for assessing self-supervised representation learning from speech
☆52Oct 8, 2021Updated 4 years ago
i3thuan5 / FaNT
View on GitHub
Filtering and Noise Adding Tool
☆29May 27, 2022Updated 4 years ago