jerry2yu / ngrams
A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency counting. Words are converted to unique IDs and encoded to more compact base 256 integers. It is a partial implementation of Dr. Vlado Keselj 's Text-Ngrams 1.6, which is a very flexible Ngram package in perl.
☆20Updated 10 years ago
Alternatives and similar repositories for ngrams
Users that are interested in ngrams are comparing it to the libraries listed below
Sorting:
- Simple Structured Perceptron tagger in Python☆10Updated 7 years ago
- ☆25Updated last year
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 3 years ago
- Entity Linking in Queries: Efficiency vs. Effectiveness☆18Updated 7 years ago
- Dependency-based Word Embeddings (Levy and Goldberg, 2014) with BZ2 compression support.☆21Updated 9 years ago
- A simple Python wrapper for the ClearNLP constituents-to-dependencies converter☆10Updated 9 years ago
- CytonMT: an Efficient Neural Machine Translation Open-source Toolkit Implemented in C++☆21Updated 6 years ago
- Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"☆42Updated 3 years ago
- Neural Reranking for Named Entity Recognition, accepted as regular paper at RANLP 2017☆23Updated 7 years ago
- This is an implementation of the paper written by Yuhua Li, David McLean, Zuhair A. Bandar, James D. O’Shea, and Keeley Crockett☆20Updated 5 years ago
- Don't Count, Predict! An Automatic Approach to Learning Sentiment Lexicons for Short Text☆13Updated 8 years ago
- Dynamic Entity Summarization (DynES)☆20Updated 6 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆31Updated 7 years ago
- Zero-Shot Open Entity Typing as Type-Compatible Grounding, EMNLP'18.☆42Updated 5 years ago
- An entity linking prototype, developed using the datasets from the TAC-KBP sub-task.☆28Updated 8 years ago
- Context Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings☆21Updated 5 years ago
- Extractive and Compressive Neural Summarization Based on Summary State Representations (NAACL 2019)☆15Updated 5 years ago
- NER System Developed at CMU☆11Updated 7 years ago
- Tree-Structured, First- and Higher-Order Linear Chain, and Semi-Markov CRFs☆45Updated 5 years ago
- C++ implementation of a part-of-speech (POS) tagger using the lookahead tagging algorithm.☆12Updated 5 years ago
- Experiment with document similarity via Matt Kusner's MWD paper☆24Updated 8 years ago
- Implementation of Word Embedding-based Antonym Detection using Thesauri and Distributional Information in NAACL2015☆35Updated 3 years ago
- Easy-first dependency parser based on Hierarchical Tree LSTMs☆33Updated 8 years ago
- Entity Linking in Queries: Tasks and Evaluation☆33Updated last year
- Programme used to project the words having vector representation. It help to visualize, how efficiently words are represented☆7Updated 9 years ago
- a fork of Ronan Collobert's senna deep learning based NLP tools☆43Updated 12 years ago
- Named Entity Recognition (NER) models (neural and sparse) implemented based on package LibN3L☆19Updated 8 years ago
- ☆19Updated 6 years ago
- Fine-grained Entity Typing / Fine-grained Entity Classification☆12Updated 7 years ago
- CRFsuite with partial annotation. Used in our paper 'Domain adaptation for CRF-based Chinese word segmentation using free annotations'☆44Updated 9 years ago