endgameinc / elasticsearch-term-pluginLinks
Term List Matching Plugin for ElasticSearch
☆26Updated 12 years ago
Alternatives and similar repositories for elasticsearch-term-plugin
Users that are interested in elasticsearch-term-plugin are comparing it to the libraries listed below
Sorting:
- Text classification using Naive Bayes and Elasticsearch☆152Updated 9 years ago
- A custom SimilarityProvider example for Elasticsearch☆36Updated 10 years ago
- Naive Bayes Classifier implemented with Elasticsearch Aggregations☆51Updated 11 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 10 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆64Updated 5 years ago
- Github mirror of "search/highlighter" - our actual code is hosted with Gerrit (please see https://www.mediawiki.org/wiki/Developer_access…☆104Updated 2 months ago
- Analysis plugin for ElasticSearch providing capability for processing inline annotations in documents.☆35Updated 12 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 9 years ago
- A DSL to build Lucene text queries in Python.☆38Updated 9 years ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated last week
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆93Updated 3 months ago
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55Updated last year
- r³ is a map-reduce engine written in python using redis as a backend☆344Updated 13 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆107Updated 12 years ago
- A python library detect and extract listing data from HTML page.☆108Updated 8 years ago
- A scrapy pipeline which send items to Elastic Search server☆98Updated 8 years ago
- Visualization and summarization of a collection of documents.☆20Updated 3 years ago
- Index URLs in Common Crawl☆198Updated 8 years ago
- Python language Plugin for elasticsearch☆103Updated 7 years ago
- Find which links on a web page are pagination links☆29Updated 9 years ago
- Carrot2 plugin for ElasticSearch☆294Updated 3 years ago
- Elasticsearch Combo Analyzer☆86Updated 8 years ago
- Python client for Elasticsearch Watcher (deprecated)☆23Updated 7 years ago
- Readability/Boilerpipe extraction in Python☆55Updated 9 years ago
- Facilitates the indexing of content from a CSV into ElasticSearch☆27Updated 12 years ago
- An attempt at creating a gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
- Site Hound (previously THH) is a Domain Discovery Tool☆23Updated last week
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fi…☆48Updated 4 years ago
- extract difference between two html pages☆32Updated last week