bejean / crawl-anywhere
Crawl-Anywhere - Web Crawler and document processing pipeline with Solr integration.
☆96Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for crawl-anywhere
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 8 years ago
- ☆65Updated 7 years ago
- The Common Crawl Crawler Engine and Related MapReduce code (2008-2012)☆212Updated last year
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆28Updated 6 years ago
- solr-logstash☆43Updated 8 years ago
- Solr Redis Extensions☆52Updated 9 months ago
- Solrstrap is a Query-Result interface for Solr written in JavaScript, HTML and CSS☆86Updated 7 years ago
- Norconex Crawlers (or spiders) are flexible web and filesystem crawlers for collecting, parsing, and manipulating data from the web or fi…☆183Updated this week
- CommonCrawl WARC/WET/WAT examples and processing code for Java + Hadoop☆56Updated 3 years ago
- ☆28Updated 8 years ago
- ☆18Updated 8 years ago
- crawler for YouTube☆48Updated 10 years ago
- Elasticsearch Index Termlist☆117Updated 5 years ago
- Feed discovery to share :)☆40Updated 8 years ago
- This is a Fact based Question Answering System using Apache Solr as backend search engine, Wikipedia dumps as information source, Apache …☆25Updated 2 years ago
- Storm / Solr Integration☆19Updated 9 months ago
- Lucene Auto Phrase TokenFilter implementation☆59Updated 6 years ago
- Twitter River Plugin for elasticsearch (STOPPED)☆204Updated 3 months ago
- Mirror of Apache Stanbol (incubating)☆112Updated 8 months ago
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆12Updated 3 years ago
- Sample Mobile / PhoneGap App built with Backbone.js and Ratchet☆124Updated 10 years ago
- Educational Examle of a custom Lucene Query & Scorer☆48Updated 4 years ago
- Solr Query Segmenter for structuring unstructured queries☆21Updated 3 years ago
- Skywalker for Elasticsearch is like Luke for Lucene☆79Updated 4 years ago
- Example code for the book "Indexing Data in Apache Solr"☆43Updated 4 years ago
- distributed realtime searchable database☆115Updated 10 years ago
- An extension to the demo template of ElasticUI a beautiful AngularJS frontend to ElasticSearch for faceted navigation☆39Updated 9 years ago
- Dice.com tutorial on using black box optimization algorithms to do relevancy tuning on your Solr Search Engine Configuration from Simon H…☆28Updated 5 years ago
- FacetView is a pure javascript frontend for ElasticSearch.☆291Updated 9 years ago