jerry2yu / ngramsLinks
A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency counting. Words are converted to unique IDs and encoded to more compact base 256 integers. It is a partial implementation of Dr. Vlado Keselj 's Text-Ngrams 1.6, which is a very flexible Ngram package in perl.
☆20Updated 10 years ago
Alternatives and similar repositories for ngrams
Users that are interested in ngrams are comparing it to the libraries listed below
Sorting:
- Extractors whose input is a chunked sentence. Includes Relnoun, Nesty, and a scala interface for ReVerb.☆28Updated 8 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆15Updated 8 years ago
- Tree-Structured, First- and Higher-Order Linear Chain, and Semi-Markov CRFs☆45Updated 6 years ago
- Dynamic Entity Summarization (DynES)☆20Updated 6 years ago
- WordRank: Learning Word Embeddings via Robust Ranking☆51Updated 7 years ago
- Easy-first dependency parser based on Hierarchical Tree LSTMs☆32Updated 9 years ago
- Entity Linking in Queries: Efficiency vs. Effectiveness☆18Updated 8 years ago
- C++ implementation of the Hellinger PCA for computing word embeddings.☆32Updated 9 years ago
- Hierarchical word clustering, following "Brown clustering" (Brown et al., 1992)☆70Updated 10 years ago
- A simple Python wrapper for the ClearNLP constituents-to-dependencies converter☆10Updated 10 years ago
- CS224S Course Project☆14Updated 11 years ago
- Entity Linking in Queries: Tasks and Evaluation☆33Updated 2 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆32Updated 8 years ago
- Context Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings☆20Updated 5 years ago
- Semantic embeddings of entities☆66Updated 9 years ago
- Experiment with document similarity via Matt Kusner's MWD paper☆24Updated 9 years ago
- Named Entity Recognition (NER) models (neural and sparse) implemented based on package LibN3L☆19Updated 9 years ago
- ☆47Updated 8 years ago
- Open-source tools for morphological tagging, segmentation and stemming.☆40Updated 6 years ago
- ☆19Updated 7 years ago
- ☆31Updated 8 years ago
- An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.☆28Updated 8 years ago
- Neural Reranking for Named Entity Recognition, accepted as regular paper at RANLP 2017☆23Updated 8 years ago
- Standalone Neural Ranking Model (SNRM)☆76Updated 7 years ago
- Visualize constituent and dependency parses as PDF or image formats, through GraphViz.☆32Updated 4 years ago
- Fast Word Clustering Software☆79Updated last year
- ☆14Updated 9 years ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 6 years ago
- Keras implementation of ontology aware token embeddings☆49Updated 7 years ago
- ☆25Updated 2 years ago