jermp / tongrams_estimation
A C++ library implementing fast language models estimation using the 1-Sort algorithm.
β17Updated last year
Alternatives and similar repositories for tongrams_estimation:
Users that are interested in tongrams_estimation are comparing it to the libraries listed below
- Utilities for manipulating finite state transducers with the OpenFst library.β31Updated 7 years ago
- π³ A compressed rank/select dictionary exploiting approximate linearity and repetitiveness.β11Updated 2 years ago
- Implementation of QuadSketch algorithmβ11Updated 2 years ago
- Implementation of many similarity join algorithms.β15Updated 11 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weightsβ19Updated 2 years ago
- Efficient and effective query auto-completion in C++.β53Updated last year
- An Efficient Language Model Using Double-Array Structuresβ17Updated 4 years ago
- An efficient algorithm for k-bounded (Damerau-)Levenshtein distanceβ16Updated 6 years ago
- A fast high dimensional near neighbor search algorithm based on group testing and locality sensitive hashingβ21Updated last year
- Risk Minimization Algorithms in Structured Prediction (JMLR 2016)β13Updated 8 years ago
- MlpIndex - Extremely fast ordered index via memory level parallelismβ12Updated 6 years ago
- Short Text Similarity as described in https://dl.acm.org/citation.cfm?id=2806475β16Updated 6 years ago
- A Space-Optimal Grammar Compressionβ10Updated 4 years ago
- Playing with arithmetic coding and RNNsβ22Updated 8 years ago
- Simplifying parsing of large jsonline files in NLP Workflowsβ12Updated 3 years ago
- Deep learning model of machine translation using attentional and structural biasesβ13Updated 7 years ago
- Question Dependent Recurrent Entity Networkβ13Updated 7 years ago
- Implementation of generative semantic grammar.β18Updated 2 years ago
- Universe-sliced indexes in C++.β18Updated 2 years ago
- Deep learning spelling patterns with a recurrent neural networkβ12Updated 7 years ago
- A neural network for filtering target speaker's voice from audio written in tensorflowβ21Updated 6 years ago
- A C++ library providing fast language model queries in compressed space.β129Updated 2 years ago
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamicβ¦β11Updated 4 years ago
- Fast stand-alone C++ decoder for RNN-based NMT modelsβ25Updated 4 years ago
- Highly specialized crate to parse and use `google/sentencepiece` 's precompiled_charsmap in `tokenizers`β18Updated 2 years ago
- LEMON: Explainable Entity Matchingβ18Updated 2 years ago
- Robust Cross-lingual Embeddings from Parallel Sentencesβ22Updated 4 years ago
- zero-vocab or low-vocab embeddingsβ18Updated 2 years ago
- NUANCED is a user-centric conversational recommendation dataset that contains 5.1k annotated dialogues and 26k high-quality user turns.β18Updated 3 years ago
- Content Addressable Memory using dimensionality reductionβ12Updated 7 years ago