jerry2yu / ngramsLinks
A package in C++ for character or word ngram analysis. It uses Ternary Search Tree instead of hashing table for faster ngram frequency counting. Words are converted to unique IDs and encoded to more compact base 256 integers. It is a partial implementation of Dr. Vlado Keselj 's Text-Ngrams 1.6, which is a very flexible Ngram package in perl.
☆20Updated 10 years ago
Alternatives and similar repositories for ngrams
Users that are interested in ngrams are comparing it to the libraries listed below
Sorting:
- ☆16Updated 10 years ago
- Code and data related to "Efficient, Compositional, Order-Sensitive n-gram Embeddings" (EACL 2017)☆14Updated 8 years ago
- Implement Overcoming the Lack of Parallel Data in Sentence Compression Katja Filippova and Yasemin Altun Google☆14Updated 8 years ago
- Context Encoders (ConEc) as a simple but powerful extension of the word2vec model for learning word embeddings☆21Updated 5 years ago
- RhetoricalRecursiveNeuralNetwork(R2N2) is recursive neural network using RST for NLP Tasks such as Sentiment Analysis☆12Updated 9 years ago
- Entity Linking in Queries: Efficiency vs. Effectiveness☆18Updated 7 years ago
- Dependency-based Word Embeddings (Levy and Goldberg, 2014) with BZ2 compression support.☆21Updated 9 years ago
- Integrated tool for learning narrative chains a la Chambers and Jurafsky, 2008.☆9Updated 9 years ago
- C++ implementation of the Hellinger PCA for computing word embeddings.☆32Updated 8 years ago
- Frame-Semantic and PropBank Semantic Role Labeling with Syntactic Scaffolding.☆50Updated 3 years ago
- Java code from the 2008 EMNLP paper "Bayesian Unsupervised Topic Segmentation" by Eisenstein and Barzilay☆36Updated 9 years ago
- CytonMT: an Efficient Neural Machine Translation Open-source Toolkit Implemented in C++☆21Updated 6 years ago
- Training scripts for paper Miceli Barone et al. 2017 "Deep Architectures for Neural Machine Translation"☆11Updated 7 years ago
- Dynamic Entity Summarization (DynES)☆20Updated 6 years ago
- ☆10Updated 6 years ago
- Implementation of Word Embedding-based Antonym Detection using Thesauri and Distributional Information in NAACL2015☆35Updated 3 years ago
- ☆21Updated 8 years ago
- The SMAPH system for query entity linking.☆20Updated 6 years ago
- Context Aware Language Models☆28Updated 6 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆31Updated 7 years ago
- Parsing only with Pretraining Networks☆16Updated 10 months ago
- The Berkeley Word Aligner☆22Updated 9 years ago
- ☆19Updated 6 years ago
- C++ implementation of a part-of-speech (POS) tagger using the lookahead tagging algorithm.☆12Updated 5 years ago
- TACL 2017☆27Updated 7 years ago
- Implementation of "Arc-swift: A Novel Transition System for Dependency Parsing"☆32Updated 6 years ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 6 years ago
- A tool for classifying mistakes in the output of parsers☆40Updated last year
- ☆13Updated 8 years ago
- Variable-order CRFs with structure learning☆16Updated 10 months ago