zhuomingliang / paoding
Lucene 中文分词“庖丁解牛” Paoding Analysis
☆25Updated 13 years ago
Alternatives and similar repositories for paoding:
Users that are interested in paoding are comparing it to the libraries listed below
- Paoding分詞器,基於Lucene4.x forked from http://git.oschina.net/zhzhenqin/paoding-analysis☆45Updated 10 years ago
- nutz+jetty+h2 做的一个web应用☆40Updated 8 years ago
- 新浪微博模拟登陆2014-04-01版☆22Updated 10 years ago
- 数据挖掘算法及工具教程☆27Updated 8 years ago
- HtmlExtractor是一个Java实现的基于模板的网页结构化信息精准抽取组件。☆157Updated 6 years ago
- Apache Nutch Plugins for AJAX page fetch, parse, index☆87Updated 6 years ago
- 基于Apache Nutch和Htmlunit的扩展实现AJAX页面爬虫抓取解析插件☆123Updated 9 years ago
- 本项目转移到https://github.com/cocolian/cocolian-nlp☆34Updated 10 years ago
- Yo demo made by YunBa android SDK☆25Updated 10 years ago
- 自定制的精准短文本搜索服务☆18Updated 3 years ago
- grab directed data.☆20Updated 9 years ago
- 通过web服务器对word分词的资源进行集中统一管理☆17Updated 7 years ago
- A lite distributed Java spider framework :-)☆146Updated 7 years ago
- ☆68Updated 9 years ago
- solr分词器大补贴, 包括IK ANSJ、过滤器,动态加载词库☆59Updated 10 years ago
- A free-style benchmarking tool that can test anything callable by Java. And it produces apache-ab-like results☆56Updated 6 years ago
- ☆233Updated 4 months ago
- 基于hadoop思维的分布式网络爬虫。☆86Updated 8 years ago
- Sharding tables in database,just like taobao tddl.☆49Updated 11 years ago
- 天亮分词器第12个小版本☆8Updated 10 years ago
- Paoding Analysis Plugin for ElasticSearch☆21Updated 11 years ago
- 数据虫巢(微信号blogchong)公众号技术文章合集。虫巢出品,不说优品,最起码也得算个良品呐~~☆25Updated 8 years ago
- An iterative computing framework for both Hadoop MapReduce and Hadoop YARN.☆71Updated 2 years ago
- HanLP Chinese Analysis Plugin for Elasticsearch http://www.elasticsearch.org☆20Updated 8 years ago
- Scala SDK for http://www.douban.com☆40Updated 8 years ago
- 自动抽取网页正文的算法,用JAVA实现☆107Updated 7 years ago
- Library to use HBase as a spout from within Storm.☆52Updated 4 years ago
- Neo4j中文手册☆48Updated 11 years ago
- An open source word breaker with lucene supported.☆80Updated 5 years ago
- Set up Wechat Pub with Docker.☆31Updated 7 years ago