jerry2yu / ngramsLinks

A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency counting. Words are converted to unique IDs and encoded to more compact base 256 integers. It is a partial implementation of Dr. Vlado Keselj 's Text-Ngrams 1.6, which is a very flexible Ngram package in perl.

☆20

Alternatives and similar repositories for ngrams

Users that are interested in ngrams are comparing it to the libraries listed below

Sorting:

shangjingbo1226 / PL2M
☆16Updated 10 years ago
azpoliak / eco
Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)
☆14Updated 8 years ago
zhaohengyang / Generate-Parallel-Data-for-Sentence-Compression
Implement Overcoming the Lack of Parallel Data in Sentence Compression Katja Filippova and Yasemin Altun Google
☆14Updated 8 years ago
cod3licious / conec
Context Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings
☆21Updated 5 years ago
parry2403 / R2N2
RhetoricalRecursiveNeuralNetwork(R2N2) is recursive neural network using RST for NLP Tasks such as Sentiment Analysis
☆12Updated 9 years ago
hasibi / EntityLinkingInQueries-Methods
Entity Linking in Queries: Efficiency vs. Effectiveness
☆18Updated 7 years ago
stephenroller / word2vecfz
Dependency-based Word Embeddings (Levy and Goldberg, 2014) with BZ2 compression support.
☆21Updated 9 years ago
rudinger / nachos
Integrated tool for learning narrative chains a la Chambers and Jurafsky, 2008.
☆9Updated 9 years ago
rlebret / hpca
C++ implementation of the Hellinger PCA for computing word embeddings.
☆32Updated 8 years ago
swabhs / scaffolding
Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.
☆50Updated 3 years ago
jacobeisenstein / bayes-seg
Java code from the 2008 EMNLP paper "Bayesian Unsupervised Topic Segmentation" by Eisenstein and Barzilay
☆36Updated 9 years ago
arthurxlw / cytonMt
CytonMT: an Efficient Neural Machine Translation Open-source Toolkit Implemented in C++
☆21Updated 6 years ago
Avmb / deep-nmt-architectures
Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"
☆11Updated 7 years ago
iai-group / DynamicEntitySummarization-DynES
Dynamic Entity Summarization (DynES)
☆20Updated 6 years ago
dbd-challenge / dbdc3
☆10Updated 6 years ago
tticoin / AntonymDetection
Implementation of Word Embedding-based Antonym Detection using Thesauri and Distributional Information in NAACL2015
☆35Updated 3 years ago
ldmt-muri / alignment-with-openfst
☆21Updated 8 years ago
marcocor / smaph
The SMAPH system for query entity linking.
☆20Updated 6 years ago
ajaech / calm
Context Aware Language Models
☆28Updated 6 years ago
benob / openfst-utils
Utilities for manipulating finite state transducers with the OpenFst library.
☆31Updated 7 years ago
aghie / parsing-as-pretraining
Parsing only with Pretraining Networks
☆16Updated 10 months ago
mhajiloo / berkeleyaligner
The Berkeley Word Aligner
☆22Updated 9 years ago
shyamupa / xelms
☆19Updated 6 years ago
brunexgeek / nlp-tools
C++ implementation of a part-of-speech (POS) tagger using the lookahead tagging algorithm.
☆12Updated 5 years ago
LeonCrashCode / InOrderParser
TACL 2017
☆27Updated 7 years ago
qipeng / arc-swift
Implementation of "Arc-swift: A Novel Transition System for Dependency Parsing"
☆32Updated 6 years ago
Oneplus / twpipe
Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.
☆28Updated 6 years ago
jkkummerfeld / berkeley-parser-analyser
A tool for classifying mistakes in the output of parsers
☆40Updated last year
clab / lstm-parser-with-beam-search
☆13Updated 8 years ago
timvieira / vocrf
Variable-order CRFs with structure learning
☆16Updated 10 months ago