momer / nutch-seleniumLinks
☆28Updated 9 years ago
Alternatives and similar repositories for nutch-selenium
Users that are interested in nutch-selenium are comparing it to the libraries listed below
Sorting:
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Updated 9 years ago
- Storm / Solr Integration☆19Updated last year
- A Query Autofiltering SearchComponent for Solr that can translate free-text queries into structured queries using index metadata☆28Updated 6 years ago
- Code to index HDFS to Solr using MapReduce☆52Updated 6 years ago
- Elasticsearch Index Termlist☆117Updated 6 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆149Updated 3 years ago
- WARC (Web Archive) Input and Output Formats for Hadoop☆35Updated 10 years ago
- The next generation of open source search☆92Updated 8 years ago
- How to spot first stories on Twitter using Storm.☆125Updated last year
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- ☆49Updated 8 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 11 years ago
- A spark sbt blueprint to build your own spark apps off of (for cloud native runtime, see the kube/spark examples)☆56Updated 6 years ago
- Beyond Piwik Analytics with Scala and Apache Spark☆46Updated 10 years ago
- Educational Examle of a custom Lucene Query & Scorer☆48Updated 5 years ago
- Lucene Auto Phrase TokenFilter implementation☆59Updated 6 years ago
- Silk is a port of Kibana 4 project.☆70Updated 9 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Cascading on Apache Flink®☆54Updated last year
- Fabric-based framework for deploying and managing SolrCloud clusters in the cloud.☆90Updated 6 years ago
- ElasticSearch Prediction Generator and Plugin☆22Updated 9 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- Text classification using Naive Bayes and Elasticsearch☆154Updated 8 years ago
- Spark RDD with Lucene's query and entity linkage capabilities☆128Updated 2 weeks ago
- An implementation of locality sensitive hashing with Hadoop☆57Updated 10 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago
- ☆66Updated 8 years ago
- Large-scale ML & graph analytics on Giraph☆78Updated 9 years ago