ATLANTBH / nutch-pluginsLinks
Apache Nutch extensions
☆35Updated 3 years ago
Alternatives and similar repositories for nutch-plugins
Users that are interested in nutch-plugins are comparing it to the libraries listed below
Sorting:
- The src for http://solr-vs-elasticsearch.com☆71Updated 7 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Apache Nutch Plugins for AJAX page fetch, parse, index☆87Updated 6 years ago
- ☆18Updated 8 years ago
- Distributed text analysis suite based on Celery☆96Updated 2 years ago
- Vagrant config to create a virtualized Storm cluster☆35Updated 11 years ago
- Code to index HDFS to Solr using MapReduce☆52Updated 6 years ago
- Example RAML Specification for InfoQ article.☆15Updated 2 years ago
- ☆32Updated last year
- ☆66Updated 8 years ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆25Updated 12 years ago
- Distributed Web Crawler, Parser and Search Engine.☆10Updated 8 years ago
- Implementation of the Chinese Whispers graph clustering algorithm☆8Updated 7 years ago
- 天亮分词器第12个小版本☆8Updated 11 years ago
- A custom SimilarityProvider example for Elasticsearch☆36Updated 9 years ago
- Elasticsearch Combo Analyzer☆85Updated 8 years ago
- Storm / Solr Integration☆19Updated last year
- Stand-alone recommender system from Myrrix☆108Updated last year
- nutz+jetty+h2 做的一个web应用☆40Updated 8 years ago
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆26Updated 4 months ago
- Paoding Analysis Plugin for ElasticSearch☆21Updated 12 years ago
- A tool for semantic relation extraction. The program finds pairs of semantically related words based on the text definitions coming from …☆26Updated 10 years ago
- 通过web服务器对word分词的资源进行集中统一管理☆20Updated 8 years ago
- Solr Redis Extensions☆53Updated last year
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- Parses Solr's log file to get some basic query statistics☆20Updated 6 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- Building recommenders with Elastic Graph!☆37Updated 4 years ago
- Content Analysis System is a framework for mining scientific publications using Apache Hadoop.☆27Updated 3 years ago
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆37Updated 10 years ago