vsiivola / variKNView external linksLinks
A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning methods.
☆42Sep 6, 2025Updated 5 months ago
Alternatives and similar repositories for variKN
Users that are interested in variKN are comparing it to the libraries listed below
Sorting:
- Properly handle position-dependent phones in a subword lexicon FST☆31Oct 26, 2020Updated 5 years ago
- The Kyoyo Language Modeling Toolkit☆27Nov 27, 2014Updated 11 years ago
- A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)☆21Sep 27, 2017Updated 8 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- Semantic dependency relationship extractor untuk bahasa Indonesia... termasuk bahasa gaul dan alay ;) (terinspirasi oleh OpenCog RelEx)☆10Oct 2, 2015Updated 10 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Dec 19, 2023Updated 2 years ago
- how to generate the full-contextual labels from un-seen text for the application of HMM-based speech synthesis (HTS)☆12Nov 22, 2019Updated 6 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- Generalized Language Modeling toolkit☆51Jun 21, 2022Updated 3 years ago
- project trying to replicate http://arxiv.org/pdf/1412.5567v2.pdf☆12Mar 22, 2015Updated 10 years ago
- Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"☆11Mar 24, 2023Updated 2 years ago
- A system for disambiguating toponyms (placenames) given textual context and creating visualizations of the locations referenced in a give…☆19Jul 24, 2013Updated 12 years ago
- OpusFilter - Parallel corpus processing toolkit☆115Updated this week
- MIT Language Modeling Toolkit☆118Nov 30, 2019Updated 6 years ago
- Document context language models☆22Nov 13, 2015Updated 10 years ago
- Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)☆11Dec 4, 2023Updated 2 years ago
- Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)☆12Aug 5, 2018Updated 7 years ago
- Speech recognition in Python made easy and flexible☆11Sep 12, 2015Updated 10 years ago
- A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.☆166May 12, 2021Updated 4 years ago
- The SRL-based Open IE extractor. A principal component of Open IE 4.0.☆19Oct 31, 2017Updated 8 years ago
- CS224S Course Project☆14Jun 9, 2014Updated 11 years ago
- ☆16May 7, 2018Updated 7 years ago
- Script for converting kaldi GMM/HMM models to HTK format☆11Jul 18, 2024Updated last year
- Links parts of input text to Wikipedia articles☆16Sep 9, 2012Updated 13 years ago
- ☆14Jun 12, 2015Updated 10 years ago
- An implementation of a HMM Ngram language model.☆11Mar 12, 2015Updated 10 years ago
- Recurrent Neural Network language modeling toolkit☆38Jan 23, 2014Updated 12 years ago
- An implementation of gibbs sampling for Latent Dirichlet Allocation☆30Aug 3, 2011Updated 14 years ago
- Word Error Rate Estimation☆16Aug 25, 2020Updated 5 years ago
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Oct 27, 2020Updated 5 years ago
- Python клиент API распознавания и синтеза речи Облака ЦРТ☆11Dec 26, 2022Updated 3 years ago
- A semantic analysis tool to generate synonym.txt files for Solr. [RETIRED]☆25Sep 14, 2016Updated 9 years ago
- Grapheme to phoneme toolkit using joint-modelling + CRFs in java