A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency counting. Words are converted to unique IDs and encoded to more compact base 256 integers. It is a partial implementation of Dr. Vlado Keselj 's Text-Ngrams 1.6, which is a very flexible Ngram package in perl.
☆20May 11, 2015Updated 10 years ago
Alternatives and similar repositories for ngrams
Users that are interested in ngrams are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- simd enabled column imprints☆11Feb 12, 2018Updated 8 years ago
- Generate word-word similarities from Gensim's latent semantic indexing (Python)☆11Jan 10, 2017Updated 9 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆20Mar 17, 2026Updated last week
- Benchmark of common hash functions☆10Sep 15, 2019Updated 6 years ago
- Library for Character/Word n-gram Analysis☆23Mar 2, 2017Updated 9 years ago
- Bloom Filter Benchmark for Heterogeneous Hardware.☆10May 19, 2019Updated 6 years ago
- Wrapper to pocketsphinx phoneme labeling tools☆18Sep 9, 2016Updated 9 years ago
- Full and flexible code to simulate several Markowitz Portfolios using R and free stock market data.☆13Nov 22, 2020Updated 5 years ago
- Tools for work with the US Dept. of Transport on-time data and MonetDB☆13Mar 14, 2021Updated 5 years ago
- JSON Schema forms implemented in Angular.☆11Nov 27, 2025Updated 3 months ago
- Query engine synthesizer based on, our domain-specific language, VOILA☆13Mar 2, 2021Updated 5 years ago
- ☆15Feb 11, 2018Updated 8 years ago
- Automatic Differentiation for OpenCL.☆20Mar 4, 2015Updated 11 years ago
- A crowdsourcing website reassembling the social network of early modern Britain☆34Nov 6, 2018Updated 7 years ago
- Fisher vectors for video classification☆21May 7, 2018Updated 7 years ago
- ☆13Oct 30, 2025Updated 4 months ago
- Lupa for Torch☆10Sep 16, 2015Updated 10 years ago
- Stand-alone implementation of the progressive indexing algorithms.☆17Apr 22, 2020Updated 5 years ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Dec 2, 2024Updated last year
- An open relation extraction system☆47Nov 23, 2021Updated 4 years ago
- ☆10Dec 8, 2017Updated 8 years ago
- General matrix multiplication for ndarrays☆18Jan 7, 2016Updated 10 years ago
- Code for Findings of EMNLP 2022 short paper "CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model".☆14May 22, 2023Updated 2 years ago
- A page describing how to ship torch binaries without sharing the source code of your scripts.☆17Nov 2, 2015Updated 10 years ago
- A Lua wrapper for the Arcade Learning Environment☆17May 9, 2014Updated 11 years ago
- ☆11Jan 22, 2015Updated 11 years ago
- ☆14Jul 29, 2019Updated 6 years ago
- UD513 - Data Structures and Algorithms in Python - Udacity Course☆15Jun 18, 2019Updated 6 years ago
- Windows library for hooking functions across processes, injecting DLLs into other applications, and more. (Somewhat similar to MS Detours…☆12Apr 2, 2013Updated 12 years ago
- A sample monorepo of several Python libraries and commands, using Bazel as build system☆13Oct 11, 2017Updated 8 years ago
- ☆10Feb 17, 2024Updated 2 years ago
- Azure TTS(Text-to-Speech) plugin for Unreal Engine☆21Sep 30, 2021Updated 4 years ago
- Wrapper of the arXiv API for Rust☆18Nov 7, 2025Updated 4 months ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆129Feb 5, 2026Updated last month
- A collection of firmwares compiled for TKG (TMK Keymap Generator).☆11Aug 8, 2018Updated 7 years ago
- ACTER is a manually annotated dataset for term extraction, covering 3 languages (English, French, and Dutch), and 4 domains (corruption, …☆24Apr 8, 2022Updated 3 years ago
- ☆24Feb 3, 2012Updated 14 years ago
- TH4J- A wrapper of torch TH library for Java (JVM langauges).☆11Nov 4, 2015Updated 10 years ago
- Yet another dependency parser, integrated with tokenizer, tagger and visualization tool.☆11Mar 18, 2018Updated 8 years ago