π LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
β22Jul 12, 2019Updated 6 years ago
Alternatives and similar repositories for LanMIT
Users that are interested in LanMIT are comparing it to the libraries listed below
Sorting:
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphoneβ35Feb 18, 2022Updated 4 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic biasβ20Jul 21, 2020Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Mar 30, 2021Updated 4 years ago
- Multistream CNN for Robust Acoustic Modelingβ40Jun 17, 2021Updated 4 years ago
- wake word spotting with kaldiβ19Dec 3, 2020Updated 5 years ago
- Adapting your own Language Model for Kaldiβ63Jan 8, 2019Updated 7 years ago
- A handy dataset of noises for ASRβ22May 29, 2019Updated 6 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 5 years ago
- Perform the forced decoding with target transcriptionβ11Sep 12, 2018Updated 7 years ago
- steps to perform text-based speaker diarization with kaldi toolkitβ12Nov 2, 2018Updated 7 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variantβ10Aug 12, 2019Updated 6 years ago
- Simple Kaldi recipe for forced alignmentβ11Jul 16, 2023Updated 2 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Zβ¦β32Apr 8, 2022Updated 3 years ago
- β13Oct 27, 2021Updated 4 years ago
- Denoising autoencoders for speaker identification on MCE 2018 challengeβ12Nov 8, 2018Updated 7 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)β11Dec 4, 2023Updated 2 years ago
- Transfer learning approach to pronunciation scoringβ11Jan 17, 2024Updated 2 years ago
- β21Sep 24, 2018Updated 7 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language modelβ33Jan 26, 2020Updated 6 years ago
- Convert words to numbersβ21Apr 13, 2022Updated 3 years ago
- β25Jun 14, 2022Updated 3 years ago
- β14Jun 12, 2015Updated 10 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Yβ¦β25May 6, 2019Updated 6 years ago
- Goodness of Pronunciation algorithm using PyKaldiβ18Jun 12, 2022Updated 3 years ago
- β17Nov 25, 2019Updated 6 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Mar 6, 2023Updated 2 years ago
- Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Languageβ43Feb 28, 2018Updated 8 years ago
- Pronunciation-assisted Subword Modelingβ31May 30, 2019Updated 6 years ago
- A GPU language model, based on btree backed tries.β29Mar 6, 2018Updated 7 years ago
- β17Apr 14, 2023Updated 2 years ago
- From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) paβ¦β17May 15, 2015Updated 10 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.β16Jun 17, 2022Updated 3 years ago
- MirasVoice is a data set consisting speech samples from bilinguals to train neural network for optimization of speaker verification algorβ¦β19Mar 15, 2020Updated 5 years ago
- a kws demo on androidβ40May 28, 2024Updated last year
- Detect emotion from audioβ13Nov 20, 2018Updated 7 years ago
- Google's TPGST reimplementation.β34Dec 11, 2019Updated 6 years ago
- Properly handle position-dependent phones in a subword lexicon FSTβ31Oct 26, 2020Updated 5 years ago
- Won't it be cool to build a speech assistant like Alexa or Siri yourself without voice API and network connection?β34May 5, 2018Updated 7 years ago
- Tools for working with the CMU Pronunciation Dictionaryβ36Sep 5, 2017Updated 8 years ago