A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency counting. Words are converted to unique IDs and encoded to more compact base 256 integers. It is a partial implementation of Dr. Vlado Keselj 's Text-Ngrams 1.6, which is a very flexible Ngram package in perl.
☆20May 11, 2015Updated 11 years ago
Alternatives and similar repositories for ngrams
Users that are interested in ngrams are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Generate word-word similarities from Gensim's latent semantic indexing (Python)☆11Jan 10, 2017Updated 9 years ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆20Updated this week
- My website developed in pelican☆14Dec 26, 2025Updated 5 months ago
- SP-10K is a large-scale human-annotated selectional preference set. Five selectional preference relations are included.☆12May 6, 2020Updated 6 years ago
- Wrapper to pocketsphinx phoneme labeling tools☆18Sep 9, 2016Updated 9 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- JSON Schema forms implemented in Angular.☆11Nov 27, 2025Updated 6 months ago
- ☆15Feb 11, 2018Updated 8 years ago
- C++ implementation of a part-of-speech (POS) tagger using the lookahead tagging algorithm.☆12Jul 2, 2019Updated 6 years ago
- Automatic Differentiation for OpenCL.☆20Mar 4, 2015Updated 11 years ago
- Fisher vectors for video classification☆21May 7, 2018Updated 8 years ago
- ☆13Oct 30, 2025Updated 7 months ago
- Lupa for Torch☆10Sep 16, 2015Updated 10 years ago
- Searching in-memory corpus with Corpus Query Language (CQL)☆19Dec 2, 2024Updated last year
- An open relation extraction system☆48Nov 23, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for Findings of EMNLP 2022 short paper "CDGP: Automatic Cloze Distractor Generation based on Pre-trained Language Model".☆14May 22, 2023Updated 3 years ago
- ☆10Dec 8, 2017Updated 8 years ago
- Official code for the paper: Scaling Transformers for Discriminative Recommendation via Generative Pretraining☆29Sep 1, 2025Updated 9 months ago
- Frida学习☆18Jun 8, 2021Updated 5 years ago
- Docker image for Tensorflow and Keras with CUDA support☆10Dec 1, 2016Updated 9 years ago
- ☆11Mar 20, 2023Updated 3 years ago
- UD513 - Data Structures and Algorithms in Python - Udacity Course☆15Jun 18, 2019Updated 6 years ago
- Automatic Gap-Fill Question Generation☆18May 30, 2024Updated 2 years ago
- The open infrastructure for Physical Intelligence. ROSClaw grounds AI agents into the physical world through e-URDF, sandbox safety, capa…☆105Jun 7, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Get your latitude/longitude via wifi access points☆15Sep 25, 2012Updated 13 years ago
- Windows library for hooking functions across processes, injecting DLLs into other applications, and more. (Somewhat similar to MS Detours…☆12Apr 2, 2013Updated 13 years ago
- A sample monorepo of several Python libraries and commands, using Bazel as build system☆13Oct 11, 2017Updated 8 years ago
- ☆10Feb 17, 2024Updated 2 years ago
- ☆14Feb 7, 2024Updated 2 years ago
- A collection of firmwares compiled for TKG (TMK Keymap Generator).☆11Aug 8, 2018Updated 7 years ago
- Redis tcp map for postfix☆12Jun 28, 2024Updated last year
- (wip) toaru c compiler☆20Nov 1, 2018Updated 7 years ago
- ☆24Feb 3, 2012Updated 14 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- TH4J- A wrapper of torch TH library for Java (JVM langauges).☆11Nov 4, 2015Updated 10 years ago
- A naive web crawler written in C☆28Mar 8, 2013Updated 13 years ago
- Yet another dependency parser, integrated with tokenizer, tagger and visualization tool.☆11Mar 18, 2018Updated 8 years ago
- .NET bindings for the Pytorch engine☆17Oct 26, 2019Updated 6 years ago
- ☆25Sep 7, 2021Updated 4 years ago
- An embedded mathematical expression evaluator in C99☆15Apr 7, 2017Updated 9 years ago
- Semantic Textual Similarity (STS) measures the degree of equivalence in the underlying semantics of paired snippets of text.☆97Oct 18, 2021Updated 4 years ago