neubig / kylmView external linksLinks
The Kyoyo Language Modeling Toolkit
☆27Nov 27, 2014Updated 11 years ago
Alternatives and similar repositories for kylm
Users that are interested in kylm are comparing it to the libraries listed below
Sorting:
- steps to perform text-based speaker diarization with kaldi toolkit☆12Nov 2, 2018Updated 7 years ago
- Grapheme to phoneme toolkit using joint-modelling + CRFs in java☆14Jul 14, 2018Updated 7 years ago
- Software to apply unsupervised word segmentation on lattices or text sequences using a nested hierarchical Pitman Yor language model☆17Nov 24, 2016Updated 9 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Convert words to numbers☆21Apr 13, 2022Updated 3 years ago
- Transfer learning approach to pronunciation scoring☆11Jan 17, 2024Updated 2 years ago
- Simple LSTM language modelling toolkit☆10Oct 21, 2022Updated 3 years ago
- Simple Kaldi recipe for forced alignment☆11Jul 16, 2023Updated 2 years ago
- Filtering and Noise Adding Tool☆29May 27, 2022Updated 3 years ago
- An Efficient Language Model Using Double-Array Structures☆17Aug 10, 2020Updated 5 years ago
- Build OpenFst using ndk-build☆11Nov 22, 2018Updated 7 years ago
- An English lexical database from the Big 🍎, let's go Mets baby love da Mets☆18Dec 12, 2025Updated 2 months ago
- This is a mirror of https://gitlab.com/tiro-is/tiro-speech-core☆15Jun 19, 2023Updated 2 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Jun 12, 2022Updated 3 years ago
- Phonetically-Oriented Word Error Rate☆36May 4, 2019Updated 6 years ago
- A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning me…☆42Sep 6, 2025Updated 5 months ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.☆15May 19, 2020Updated 5 years ago
- Software for unsupervised word segmentation and language model learning using lattices☆45Aug 17, 2016Updated 9 years ago
- Automatically exported from code.google.com/p/m2m-aligner☆42Apr 12, 2016Updated 9 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Aug 3, 2022Updated 3 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- language models toolkits with hierarchical softmax setting☆17Mar 23, 2018Updated 7 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Mar 6, 2023Updated 2 years ago
- A handy dataset of noises for ASR☆22May 29, 2019Updated 6 years ago
- BurrMill core☆22Nov 2, 2021Updated 4 years ago
- A baseline Automatic Speech Recognition system for Polish based on Kaldi.☆18Dec 21, 2021Updated 4 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Mar 29, 2021Updated 4 years ago
- All you need to get started for the Zero Speech Challenge 2017☆47Apr 23, 2019Updated 6 years ago
- Punctuation generation for speech transcripts using lexical and prosodic features☆42Mar 5, 2019Updated 6 years ago
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- 📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.☆22Jul 12, 2019Updated 6 years ago
- Analytic signal-based source information analysis for YANGstraight and real-time interactive tools☆34Aug 20, 2019Updated 6 years ago
- Namelti : The automatic transcription generation library for person name in Katakana☆21Jul 10, 2023Updated 2 years ago
- Implements Rocchio Query Expansion - similar to "related searches:" found at popular search engines but based on relevant documents selec…☆20Sep 12, 2016Updated 9 years ago
- Java interfaces and tools for Kaldi speech recognition.☆20Oct 2, 2016Updated 9 years ago
- (semi) Grapheme-to-Phoneme (G2P) - seq2seq model using PyTorch for Korean☆23Dec 17, 2017Updated 8 years ago
- Long audio alignment using Kaldi☆23Apr 22, 2021Updated 4 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated last year