jerry2yu / ngramsLinks
A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency counting. Words are converted to unique IDs and encoded to more compact base 256 integers. It is a partial implementation of Dr. Vlado Keselj 's Text-Ngrams 1.6, which is a very flexible Ngram package in perl.
☆20Updated 10 years ago
Alternatives and similar repositories for ngrams
Users that are interested in ngrams are comparing it to the libraries listed below
Sorting:
- ☆21Updated 8 years ago
- Tree-Structured, First- and Higher-Order Linear Chain, and Semi-Markov CRFs☆45Updated 5 years ago
- WordRank: Learning Word Embeddings via Robust Ranking☆51Updated 7 years ago
- CS224S Course Project☆14Updated 11 years ago
- This repository contains source code to binarize any real-value word embeddings into binary vectors.☆48Updated 4 years ago
- Dynamic Entity Summarization (DynES)☆20Updated 6 years ago
- Standalone Neural Ranking Model (SNRM)☆76Updated 6 years ago
- Extractive and Compressive Neural Summarization Based on Summary State Representations (NAACL 2019)☆16Updated 5 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆129Updated 10 months ago
- Extractors whose input is a chunked sentence. Includes Relnoun, Nesty, and a scala interface for ReVerb.☆28Updated 8 years ago
- SMOR (Stuttgart Morphology) with alternative lemmatization component☆12Updated 2 years ago
- ☆43Updated 10 years ago
- Corpus preprocessing☆99Updated last year
- SWIG Wrapper for the SRILM toolkit☆35Updated 5 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆14Updated 8 years ago
- Entity Linking in Queries: Tasks and Evaluation☆33Updated 2 years ago
- An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.☆28Updated 8 years ago
- This repository includes code for replicating the results in the paper "Word Ordering Without Syntax" (2016).☆21Updated 8 years ago
- Language modeling scripts based on TensorFlow☆58Updated 6 years ago
- Easy-first dependency parser based on Hierarchical Tree LSTMs☆32Updated 8 years ago
- Semantic embeddings of entities☆66Updated 9 years ago
- cicada: a hypergraph-based toolkit for statistical machine translation based on {tree, string}-to-{tree, string} models☆42Updated 4 years ago
- Implicit relation extractor using a natural language model.☆24Updated 7 years ago
- Morfessor FlatCat☆13Updated 6 years ago
- Named Entity Recognition (NER) models (neural and sparse) implemented based on package LibN3L☆19Updated 8 years ago
- RhetoricalRecursiveNeuralNetwork(R2N2) is recursive neural network using RST for NLP Tasks such as Sentiment Analysis☆12Updated 10 years ago
- Hacky implementation of ppjoin by Chuan Xia et Al☆19Updated 11 years ago
- Deep learning model of machine translation using attentional and structural biases☆13Updated 8 years ago
- Entity Linking in Queries: Efficiency vs. Effectiveness☆18Updated 7 years ago
- Fine-grained Entity Typing / Fine-grained Entity Classification☆12Updated 7 years ago