yohokuno / count-ngram
Count frequent n-gram from big data with limited memory.
☆59Updated 11 years ago
Alternatives and similar repositories for count-ngram
Users that are interested in count-ngram are comparing it to the libraries listed below
Sorting:
- Code for the ACL-2015 paper "Accurate Linear-Time Chinese Word Segmentation via Embedding Matching"☆38Updated 9 years ago
- ☆29Updated 9 years ago
- tyccl(同义词词林) is a ruby gem that provides friendly functions to analyse similarity between Chinese Words.☆46Updated 11 years ago
- Code for Exploring Segment Representations for Neural Segmentation Models☆30Updated 8 years ago
- A light-weight matrix factorization tool☆39Updated 7 years ago
- LibN3L: A light-weight neural network package for natural language☆82Updated 9 years ago
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆37Updated 10 years ago
- Distributed LDA, takes raw text as input and outputs topic word table.☆16Updated 9 years ago
- Deep reinforcement learning with TensorFlow☆47Updated 7 years ago
- ☆70Updated 10 years ago
- Deep Learning for NLP resources☆17Updated 9 years ago
- Three open source versions of LDA with collapsed Gibbs Sampling, modified by nanjunxiao☆26Updated 9 years ago
- Sentiment Analysis with Ensemble☆244Updated 8 years ago
- java neural network☆16Updated 8 years ago
- A C++ toolkit for neural machine translation for CPU☆88Updated 5 years ago
- Word segmentation using neural networks based on package https://github.com/SUTDNLP/LibN3L☆23Updated 9 years ago
- ☆87Updated 8 years ago
- Cache efficient implementation for Latent Dirichlet Allocation☆164Updated 6 years ago
- This code is for Convolutional Latent Semantic Model, which is similay with DSSM(Deep Semantic Similarity Model).☆25Updated 9 years ago
- Train a CRF for syntactic chunking (CoNLL2000), and use word representations☆43Updated 15 years ago
- Chinese Words Segment Library based on HMM model☆166Updated 10 years ago
- An reimplementation of Microsoft DSSM☆12Updated 9 years ago
- NLTK Source☆31Updated 10 years ago
- The tensorflow implementation of NIPS2016 paper "LightRNN: Memory and Computation-Efficient Recurrent Neural Networks" (https://arxiv.org…☆56Updated 8 years ago
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 12 years ago
- LASSO is a parallel regression model learning system☆69Updated 11 years ago
- compare embedding☆237Updated 9 years ago
- Chinese Word Similarity Computation based on HowNet☆27Updated 7 years ago
- ☆127Updated 8 years ago
- Parallelizing word2vec in shared and distributed memory☆190Updated 2 years ago