jermp / tongrams_estimationLinks
A C++ library implementing fast language models estimation using the 1-Sort algorithm.
☆17Updated 2 years ago
Alternatives and similar repositories for tongrams_estimation
Users that are interested in tongrams_estimation are comparing it to the libraries listed below
Sorting:
- Utilities for manipulating finite state transducers with the OpenFst library.☆32Updated 7 years ago
- A C++ library providing fast language model queries in compressed space.☆132Updated 2 years ago
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆41Updated 2 years ago
- Deep learning model of machine translation using attentional and structural biases☆13Updated 8 years ago
- UniParse: A universal graph-based parsing toolkit☆10Updated 5 years ago
- A database of number names for 186 languages, locales, and scripts☆67Updated 2 years ago
- An Efficient Language Model Using Double-Array Structures☆17Updated 5 years ago
- Attentional Neural Network that translates text to phones.☆11Updated 7 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Updated 2 years ago
- Efficient and effective query auto-completion in C++.☆55Updated last year
- BERT models for many languages created from Wikipedia texts☆33Updated 5 years ago
- Fast stand-alone C++ decoder for RNN-based NMT models☆28Updated 4 years ago
- MozoLM: A language model (LM) serving library☆45Updated last week
- Library and command line utility to do approximate string matching of a source against a bitext index and get matched source and target.☆52Updated 4 months ago
- ☆28Updated 4 years ago
- Fast SymSpell written in c++ and exposes to python via pybind11☆44Updated 6 months ago
- Converter from UD-trees to BART representation☆36Updated last year
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Updated 3 years ago
- zero-vocab or low-vocab embeddings☆18Updated 3 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 4 years ago
- Compute the most likely permutation of a lattice given an LM☆10Updated 12 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 4 years ago
- ☆16Updated 6 years ago
- Multi-lingual Text Processing☆96Updated 6 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- ASR transcription and SLU annotation web interface for call logs collected at UFAL-DSG.☆11Updated 10 years ago
- Read-only unofficial mirror of OpenFst☆44Updated 3 years ago
- Library for fast text representation and classification.☆31Updated last year
- CS224S Course Project☆14Updated 11 years ago
- A framework for graph-based dependency parsing.☆17Updated 3 years ago