ATLANTBH / nutch-plugins
Apache Nutch extensions
☆36Updated 3 years ago
Alternatives and similar repositories for nutch-plugins
Users that are interested in nutch-plugins are comparing it to the libraries listed below
Sorting:
- word2vec-java☆7Updated 7 months ago
- Distributed Web Crawler, Parser and Search Engine.☆10Updated 8 years ago
- Elasticsearch Combo Analyzer☆85Updated 8 years ago
- ☆66Updated 8 years ago
- Example RAML Specification for InfoQ article.☆15Updated 2 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Clone version of LingPipe 4.1.0, with support for unsupervised training☆32Updated 11 years ago
- This plugin provides a useful feature for multi-language☆14Updated 2 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- Stand-alone recommender system from Myrrix☆108Updated last year
- Storm / Solr Integration☆19Updated last year
- ☆32Updated last year
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Updated 8 years ago
- Experimental logistic regression code supporting multiple result categories, many levels of categorical modeling variables, good optimiza…☆36Updated 4 years ago
- Keyword query search engine on semantic store/linked data web☆9Updated 9 years ago
- Chef cookbook to Manage Apache Solr☆20Updated 9 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 8 years ago
- The first Open Source document analysis platform☆65Updated 3 years ago
- ☆18Updated 8 years ago
- A project to demonstrate maximum entropy models for extracting quotes from news articles in Python.☆25Updated 12 years ago
- Includes Code for Inference and Evaluation of Topic Models for Selectional Preferences☆16Updated 2 years ago
- Implementation of the Chinese Whispers graph clustering algorithm☆8Updated 7 years ago
- A Chinese Words Segmentation Tool Based on Bayes Model☆79Updated 11 years ago
- ☆22Updated last year
- ☆9Updated 6 years ago
- Apache Nutch Plugins for AJAX page fetch, parse, index☆87Updated 6 years ago
- Structured Data Extractor. An application to extract structured data from web pages. It uses Data Extraction Based on Partial Tree Alignm…☆49Updated 12 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- Kafka River Plugin for ElasticSearch☆87Updated 11 years ago