yingrui / mahjong
开源中文分词工具包,中文分词Web API,Lucene中文分词,中英文混合分词
☆43Updated 4 years ago
Alternatives and similar repositories for mahjong:
Users that are interested in mahjong are comparing it to the libraries listed below
- 中文自然语言处理工具包☆86Updated 9 years ago
- tyccl(同义词词林) is a ruby gem that provides friendly functions to analyse similarity between Chinese Words.☆46Updated 11 years ago
- Parallelizing Stochastic Gradient Descent for Deep Convolutional Neural Network☆45Updated 9 years ago
- 基于Akka开发的爬虫服务,非阻塞、高并发、实时☆26Updated 9 years ago
- Predictive analatics using deepLearning4j and Spark☆26Updated 8 years ago
- nutz+jetty+h2 做的一个web应用☆40Updated 8 years ago
- stan-cn-nlp: an API wrapper based on Stanford NLP packages for the convenience of Chinese users☆57Updated 8 years ago
- An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.☆107Updated 10 years ago
- Chinese Tokenizer; New words Finder. 中文三段式机械分词算法; 未登录新词发现算法☆95Updated 8 years ago
- Clone version of LingPipe 4.1.0, with support for unsupervised training☆32Updated 11 years ago
- this is my presentaion area .个人演讲稿展示区,主要展示一些平时的个人演讲稿或者心得之类的,☆57Updated 4 years ago
- Detect duplicated items。内容排重框架。☆11Updated 10 years ago
- 本项目转移到https://github.com/cocolian/cocolian-nlp☆34Updated 10 years ago
- 复旦的中文自然语言工具包☆72Updated 7 years ago
- Distributed optimization framework with parameter server☆23Updated 9 years ago
- LASER-A Scalable Response Prediction Platform For Online Advertising☆48Updated 10 years ago
- LDA 的java实现☆63Updated 9 years ago
- the Chinese NLP full stack toolkit☆41Updated 10 years ago
- ☆10Updated 9 years ago
- 基于Scala Akka的分布式主题网络爬虫☆14Updated 5 years ago
- Open-domain question answering system from UNC Charlotte☆61Updated 9 years ago
- An interface of mllib and ml algorithms implemented by jddata with spark☆23Updated 10 years ago
- the python code of the book:Machine Learning for Spark☆8Updated 8 years ago
- tools for chinese word segmentation and pos tagging written in python☆38Updated 11 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
- Berkeley DB Java Edition is a open source, transactional storage solution for Java applications. The Direct Persistence Layer (DPL) API i…☆13Updated 11 years ago
- 文本去重算法,研究自推荐系统中新闻的去重,采用了雅虎的Near-duplicates and shingling算法,服务端用c实现,客户端用java实现,利用thrift框架进行通信,为了提高扩展性,去重可以在服务端实现,服务器也提供了计算的接口,方便客户端自己扩展☆24Updated 11 years ago
- word2vec的Java并行实现☆126Updated 8 years ago
- A Chinese Words Segmentation Tool Based on Bayes Model☆79Updated 11 years ago
- A fork of cascading patterns, but implemented for trident☆71Updated last year