vsiivola/variKN

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vsiivola/variKN)

vsiivola / variKN

A toolkit for producing n-gram language models. The highlights are the implementation of Kneser-Ney growing and revised Kneser pruning methods.

☆42

Alternatives and similar repositories for variKN

Users that are interested in variKN are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

aalto-speech / subword-kaldi
View on GitHub
Properly handle position-dependent phones in a subword lexicon FST
☆31Oct 26, 2020Updated 5 years ago
idiap / inv-tn
View on GitHub
A bunch of scripts exploiting several tools to perform inverse text normalization (ITN)
☆21Sep 27, 2017Updated 8 years ago
ymoslem / MT-Tools
View on GitHub
Collection of Common Machine Translation Tools
☆11Jul 26, 2022Updated 4 years ago
freerussianasr / recipes
View on GitHub
☆16May 7, 2018Updated 8 years ago
Helsinki-NLP / OPUS-MT-testsets
View on GitHub
benchmarks for evaluating MT models
☆11Jun 26, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Helsinki-NLP / OpusFilter
View on GitHub
OpusFilter - Parallel corpus processing toolkit
☆115Jul 1, 2026Updated 3 weeks ago
mitlm / mitlm
View on GitHub
MIT Language Modeling Toolkit
☆120Nov 30, 2019Updated 6 years ago
dansoutner / kaldi2htk
View on GitHub
Script for converting kaldi GMM/HMM models to HTK format
☆11Jul 18, 2024Updated 2 years ago
TomEversdijk / Git-Guide
View on GitHub
A basic git guide
☆12Jul 18, 2022Updated 4 years ago
kan-bayashi / Taco2withBERT
View on GitHub
Tacotron2 with BERT examples
☆10Jul 8, 2019Updated 7 years ago
speechpro / cloud-python
View on GitHub
Python клиент API распознавания и синтеза речи Облака ЦРТ
☆11Dec 26, 2022Updated 3 years ago
google-research-datasets / TextNormalizationCoveringGrammars
View on GitHub
Covering grammars for English and Russian text normalization
☆61Sep 15, 2019Updated 6 years ago
jiyfeng / dclm
View on GitHub
Document context language models
☆21Nov 13, 2015Updated 10 years ago
qcri / e-wer
View on GitHub
Word Error Rate Estimation
☆16Aug 25, 2020Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
lmc2179 / ngram-language-model
View on GitHub
An implementation of a HMM Ngram language model.
☆10Mar 12, 2015Updated 11 years ago
uds-lsv / TF-NNLM-TK
View on GitHub
A toolkit for neural language modeling using Tensorflow including basic models like RNNs and LSTMs as well as more advanced models.
☆21Jan 31, 2019Updated 7 years ago
GuillaumeDD / dialign
View on GitHub
Automatic and generic measures of verbal alignment in dyadic dialogue based on sequential pattern mining at the level of surface of text …
☆13May 11, 2025Updated last year
BramVanroy / spacy_download
View on GitHub
Download and load spaCy models on-the-fly
☆15Feb 9, 2023Updated 3 years ago
burrmill / burrmill
View on GitHub
BurrMill core
☆22Nov 2, 2021Updated 4 years ago
moses-smt / mgiza
View on GitHub
A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.
☆167May 12, 2021Updated 5 years ago
PiotrTa / Huawei-Challenge-Speaker-Identification
View on GitHub
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
☆36Oct 4, 2019Updated 6 years ago
isca-sig-rosp / ISCA-SIG-RoSP
View on GitHub
Web page for ISCA Special Interest Group: Robust Speech Processing (RoSP)
☆11Dec 4, 2023Updated 2 years ago
danpovey / pocolm
View on GitHub
Small language toolkit for creation, interpolation and pruning of ARPA language models
☆92Aug 6, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
KWTsou1220 / mann-for-speech-separation
View on GitHub
Neural Turing machine for source separation in Tensorflow
☆18Aug 16, 2017Updated 8 years ago
neubig / kylm
View on GitHub
The Kyoyo Language Modeling Toolkit
☆27Nov 27, 2014Updated 11 years ago
vadimkantorov / inferspeech
View on GitHub
PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variant
☆10Aug 12, 2019Updated 6 years ago
CSLT-THU / IS2019-VAE
View on GitHub
Tensorflow and kaldi implementation of our paper "VAE-based regularization for deep speaker embedding"
☆11Mar 24, 2023Updated 3 years ago
vectominist / MiniASR
View on GitHub
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
bajibabu / make_full_labels
View on GitHub
how to generate the full-contextual labels from un-seen text for the application of HMM-based speech synthesis (HTS)
☆12Nov 22, 2019Updated 6 years ago
knub / sentence-boundary-detection-nn
View on GitHub
Sentence Boundary Detection using Deep Neural Networks.
☆20Oct 24, 2016Updated 9 years ago
jpuigcerver / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆81Jun 10, 2019Updated 7 years ago
robertostling / eflomal
View on GitHub
Efficient Low-Memory Aligner
☆148Jan 15, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
AI-Guru / SincNet
View on GitHub
Keras implementation of SincNet (https://github.com/mravanelli/SincNet, https://arxiv.org/abs/1808.00158)
☆12Aug 5, 2018Updated 7 years ago
jzlianglu / pykaldi2
View on GitHub
Yet another speech toolkit based on Kaldi and PyTorch
☆173Jul 1, 2020Updated 6 years ago
Avmb / clweadv
View on GitHub
Towards cross-lingual distributed representations without parallel text trained with adversarial autoencoders
☆22Aug 11, 2016Updated 9 years ago
lolpa1n / digital-peter-ocrv
View on GitHub
1st place (public LB) solution of AIJ2020 Sberbank competition (Digital Peter)
☆18Nov 22, 2020Updated 5 years ago
anyks / alm
View on GitHub
Smart Language Model
☆45Dec 21, 2022Updated 3 years ago
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
krylm / whisper-event-tuning
View on GitHub
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Dec 24, 2022Updated 3 years ago