yingrui / mahjong
开源中文分词工具包,中文分词Web API,Lucene中文分词,中英文混合分词
☆42Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for mahjong
- tyccl(同义词词林) is a ruby gem that provides friendly functions to analyse similarity between Chinese Words.☆46Updated 10 years ago
- 中文自然语言处理工具包☆85Updated 9 years ago
- yet another segement☆20Updated 11 years ago
- LDA 的java实现☆62Updated 8 years ago
- Chinese Tokenizer; New words Finder. 中文三段式机械分词算法; 未登录新词发现算法☆95Updated 8 years ago
- A Chinese Words Segmentation Tool Based on Bayes Model☆78Updated 11 years ago
- Machine Learning Using Spark☆7Updated 9 years ago
- auto generate chinese words in huge text.☆24Updated 10 years ago
- ☆10Updated 9 years ago
- ☆10Updated 9 years ago
- Code for the ACL-2015 paper "Accurate Linear-Time Chinese Word Segmentation via Embedding Matching"☆38Updated 8 years ago
- stan-cn-nlp: an API wrapper based on Stanford NLP packages for the convenience of Chinese users☆56Updated 8 years ago
- Parallelizing Stochastic Gradient Descent for Deep Convolutional Neural Network☆45Updated 8 years ago
- 复旦的中文自然语言工具包☆70Updated 7 years ago
- Detect duplicated items。内容排重框架。☆11Updated 9 years ago
- LASER-A Scalable Response Prediction Platform For Online Advertising☆47Updated 10 years ago
- 基于深度学习的中文分词尝试☆85Updated 9 years ago
- DeepDriver is a JAVA framework of Deep Learning, it supports ANN/CNN/DNN/RNN/LSTM now, hope it can be widely used for deep learning devel…☆98Updated 6 years ago
- Predictive analatics using deepLearning4j and Spark☆26Updated 7 years ago
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆37Updated 9 years ago
- Clone version of LingPipe 4.1.0, with support for unsupervised training☆32Updated 11 years ago
- Distributed text analysis suite based on Celery☆95Updated last year
- Stanford CoreNLP: A Java suite of core NLP tools.☆8Updated 8 years ago
- 基于Spark的LambdaMART实现☆11Updated 9 years ago
- nutz+jetty+h2 做的一个web应用☆40Updated 8 years ago
- word2vec variations☆8Updated 6 years ago