yingrui / mahjongLinks
开源中文分词工具包,中文分词Web API,Lucene中文分词,中英文混合分词
☆43Updated 4 years ago
Alternatives and similar repositories for mahjong
Users that are interested in mahjong are comparing it to the libraries listed below
Sorting:
- LASER-A Scalable Response Prediction Platform For Online Advertising☆48Updated 10 years ago
- Open-domain question answering system from UNC Charlotte☆61Updated 9 years ago
- LDA 的java实现☆63Updated 9 years ago
- 本项目转移到https://github.com/cocolian/cocolian-nlp☆34Updated 11 years ago
- tyccl(同义词词林) is a ruby gem that provides friendly functions to analyse similarity between Chinese Words.☆46Updated 11 years ago
- 复旦的中文自然语言工具包☆72Updated 8 years ago
- Parallelizing Stochastic Gradient Descent for Deep Convolutional Neural Network☆45Updated 9 years ago
- word2vec的Java并行实现☆126Updated 9 years ago
- Stand-alone recommender system from Myrrix☆108Updated last year
- 基于Akka开发的爬虫服务,非阻塞、高并发、实时☆26Updated 9 years ago
- 中文自然语言处理工具包☆86Updated 10 years ago
- 文本去重算法,研究自推荐系统中新闻的去重,采用了雅虎的Near-duplicates and shingling算法,服务端用c实现,客户端用java实现,利用thrift框架进行通信,为了提高扩展性,去重可以在服务端实现,服务器也提供了计算的接口,方便客户端自己扩展☆24Updated 11 years ago
- Ytk-mp4j is a fast, user-friendly, cross-platform, multi-process, multi-thread collective message passing java library which includes gat…☆108Updated 8 years ago
- A Chinese Words Segmentation Tool Based on Bayes Model☆79Updated 12 years ago
- Distributed text analysis suite based on Celery☆96Updated 2 years ago
- 阅读论文备份☆17Updated 9 years ago
- NanGe - A Rule-based Chinese-English Machine Translation System☆20Updated 7 years ago
- Detect duplicated items。内容排重框架。☆11Updated 10 years ago
- Introduction and implementation of the strategies(include Thompson Sampling) for multi-armed bandit problem☆44Updated 7 years ago
- Clone version of LingPipe 4.1.0, with support for unsupervised training☆32Updated 11 years ago
- 语义、情感、相似度分析。☆58Updated 9 years ago
- Item-Based Collaborative Filtering Spark Job (use cosin similarity)☆37Updated 8 years ago
- keywords extraction☆18Updated 9 years ago
- 相似度计算软件包☆190Updated last year
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 12 years ago
- tools for chinese word segmentation and pos tagging written in python☆38Updated 11 years ago
- 新词发现分布式机器学习算法。☆15Updated 10 years ago
- An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.☆107Updated 10 years ago
- ☆10Updated 9 years ago
- ☆29Updated 9 years ago