Decoders from Kaldi using OpenFst
☆34Jan 29, 2026Updated last month
Alternatives and similar repositories for kaldi-decoder
Users that are interested in kaldi-decoder are comparing it to the libraries listed below
Sorting:
- ☆28Oct 7, 2025Updated 4 months ago
- Python wrapper for kaldi's arpa2fst☆38Aug 27, 2025Updated 6 months ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆55Sep 1, 2025Updated 6 months ago
- Memory efficient transducer loss computation☆69Jun 10, 2022Updated 3 years ago
- ☆24Mar 13, 2020Updated 5 years ago
- ☆17Nov 25, 2019Updated 6 years ago
- Colab notebooks for Next-gen Kaldi☆29Oct 12, 2025Updated 4 months ago
- c# wrapper for kaldi-native-fbank,used to extract audio features in speech recognition (ASR) task☆10Jul 26, 2025Updated 7 months ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆213Aug 7, 2025Updated 6 months ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Feb 4, 2020Updated 6 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆79Jun 30, 2025Updated 8 months ago
- A fast parallel implementation of RNN Transducer.☆12Apr 8, 2025Updated 10 months ago
- Efficient Neural Architecture Search via Straight-Through Gradients☆13Nov 12, 2020Updated 5 years ago
- MagicData-RAMC Dataset and Baseline☆57Sep 13, 2022Updated 3 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆58Jun 1, 2018Updated 7 years ago
- Moved to https://github.com/k2-fsa/icefall☆146Oct 13, 2022Updated 3 years ago
- Libriheavy: a 50,000 hours ASR corpus with punctuation casing and context☆214Sep 10, 2024Updated last year
- E2E system with LF-MMI; word N-gram for Mandarin☆166Apr 29, 2022Updated 3 years ago
- A native-PyTorch library for large scale M-LLM (text/audio) training with tp/cp/dp.☆225Aug 6, 2025Updated 6 months ago
- This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …☆40Mar 13, 2024Updated last year
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆19May 12, 2023Updated 2 years ago
- 5Hz Deep-Compression Speech VAE for AR-Diffusion and CALMs☆57Nov 19, 2025Updated 3 months ago
- ☆16Jun 13, 2022Updated 3 years ago
- Multilingual and code-switching ASR challenges for low resource Indian languages.☆21Jul 26, 2021Updated 4 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆150Aug 25, 2023Updated 2 years ago
- Torch Audio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆62Sep 5, 2025Updated 5 months ago
- Kaldi-compatible online fbank extractor without external dependencies☆141Oct 9, 2025Updated 4 months ago
- An efficient OpenFST-based tool for calculating WER and aligning two transcript sequences.☆170Jan 7, 2026Updated last month
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- Dart plugin wrapping the Sherpa-ONNX runtime. Contains example for speech recognition with Flutter☆22Jan 3, 2025Updated last year
- KittenTTS is an ultra-lightweight, CPU-friendly text-to-speech model with 15M params for real-time, high-quality voices. Open source, fas…☆23Updated this week
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Pronunciation lexicon covering both English and Chinese languages for Automatic Speech Recognition.☆262Oct 11, 2019Updated 6 years ago
- An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.☆67Jan 7, 2026Updated last month
- My solution to course E6870 (Speech Recognition) of Columbia University.☆37May 13, 2018Updated 7 years ago