momer / nutch-selenium-grid-plugin
A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This allows Nutch to rely on Selenium/Firefox to fetch and load javascript/content; while keeping Nutch in charge of what it does best: crawling and further parsing.
☆16Updated 8 years ago
Related projects: ⓘ
- ☆28Updated 8 years ago
- ☆19Updated 2 years ago
- ☆17Updated this week
- The first Open Source document analysis platform☆65Updated 3 years ago
- Storm / Solr Integration☆19Updated 7 months ago
- Distributed processing framework for search solutions☆81Updated last year
- Preliminary Solr DQ / Data Quality experiments and prototype, and SolrJ wrapper utilities☆25Updated 2 years ago
- The next generation of open source search☆90Updated 7 years ago
- ☆13Updated this week
- Hadoop Plugin for ElasticSearch☆61Updated last month
- A big data cluster management tool that creates and manages clusters of different technologies.☆21Updated 9 years ago
- Lucene plugin for indexing and searching files stored in Baratine distributed filesystem☆16Updated 8 years ago
- Crabs is a SQL-like JDBC driver and command line for elastic search. With it you may use elasticsearch as simply as using SQL with tradit…☆25Updated 9 years ago
- Nutch 2.3.1 plugin for whitelisting/blacklisting specific HTML elements☆12Updated 2 years ago
- [Deprecated] Simple docker image to run a Glassfish server☆12Updated 7 years ago
- Object Search Engine Mapping for ElasticSearch☆17Updated 10 years ago
- ☆18Updated this week
- Baratine Auction Application☆11Updated 7 years ago
- A Vert.x based micro service framework☆12Updated 8 years ago
- Spring integration for Dropwizard☆67Updated 9 years ago
- ☆26Updated this week
- Servlet transport for Elasticsearch☆41Updated last month
- Vert.x 2.x is deprecated - use instead☆49Updated 7 years ago
- Kafka River Plugin for ElasticSearch☆88Updated 11 years ago
- ☆19Updated this week
- Helper classes for Elasticsearch client☆20Updated 7 years ago
- Collects multimedia content shared through social networks.☆19Updated 9 years ago
- Vert.x elasticsearch service with event bus proxying☆57Updated 7 years ago
- Secure REST service to index, search, retrieve and aggregate content from heterogeneous sources.☆19Updated 9 months ago
- Docker, An easy way to try Apache Storm☆39Updated 7 years ago