jerry2yu / ngramsLinks
A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency counting. Words are converted to unique IDs and encoded to more compact base 256 integers. It is a partial implementation of Dr. Vlado Keselj 's Text-Ngrams 1.6, which is a very flexible Ngram package in perl.
☆20Updated 10 years ago
Alternatives and similar repositories for ngrams
Users that are interested in ngrams are comparing it to the libraries listed below
Sorting:
- My most frequently used learning-to-rank algorithms ported to rust for efficiency. Try it: "pip install fastrank".☆52Updated 6 months ago
- Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.☆42Updated 5 years ago
- Extractors whose input is a chunked sentence. Includes Relnoun, Nesty, and a scala interface for ReVerb.☆28Updated 7 years ago
- This repository includes code for replicating the results in the paper "Word Ordering Without Syntax" (2016).☆21Updated 8 years ago
- Tools for working with the TREC CAR dataset.☆35Updated last month
- Accompanying repository of our AAAI-20 paper "Fine-Grained Argument Unit Recognition and Classification."☆21Updated 5 years ago
- Tree-Structured, First- and Higher-Order Linear Chain, and Semi-Markov CRFs☆45Updated 5 years ago
- Dynamic Entity Summarization (DynES)☆20Updated 6 years ago
- Java code from the 2008 EMNLP paper "Bayesian Unsupervised Topic Segmentation" by Eisenstein and Barzilay☆36Updated 9 years ago
- ☆21Updated 8 years ago
- Extractive and Compressive Neural Summarization Based on Summary State Representations (NAACL 2019)☆16Updated 5 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated 2 years ago
- Neural topic modeling☆29Updated 5 years ago
- Analyzes news stories for event schemas and templates.☆17Updated 9 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆14Updated 8 years ago
- Standalone Neural Ranking Model (SNRM)☆76Updated 6 years ago
- This repository contains source code to binarize any real-value word embeddings into binary vectors.☆47Updated 4 years ago
- Entity Linking in Queries: Efficiency vs. Effectiveness☆18Updated 7 years ago
- Word embedding approach based on a dynamic log-linear model☆54Updated 8 years ago
- ☆14Updated 8 years ago
- source code of bison☆26Updated 5 years ago
- WordRank: Learning Word Embeddings via Robust Ranking☆51Updated 7 years ago
- Statistical discontinuous constituent parsing☆11Updated 7 years ago
- Knowledge graph based information retrieval☆13Updated 6 years ago
- CS224S Course Project☆14Updated 11 years ago
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 4 years ago
- Entity Linking in Queries: Tasks and Evaluation☆33Updated last year
- Experimental search engine in C/C++17 - still in early development.☆27Updated this week
- An Efficient Language Model Using Double-Array Structures☆17Updated 5 years ago
- Modularizing Unsupervised Sense Embedding☆29Updated 7 years ago