meabed / nutch-cassandra-docker
Nutch with Cassandra and Elasticsearch on Docker
☆17Updated 3 years ago
Alternatives and similar repositories for nutch-cassandra-docker:
Users that are interested in nutch-cassandra-docker are comparing it to the libraries listed below
- PredictionIO Classification Engine Template (Scala-based parallelized engine)☆39Updated 5 years ago
- ☆27Updated 9 years ago
- ☆28Updated 8 years ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Updated 8 years ago
- Parse wikipedia dumps and index (some) page data to elasticsearch☆49Updated 9 years ago
- This repository contains a Docker image of the latest version of the Neo4j community server☆48Updated 5 years ago
- A set of scripts and config files to run a Cassandra cluster from Docker☆216Updated 11 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 9 years ago
- Docker containers for Druid nodes☆27Updated 8 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 7 years ago
- IoT - It's the thing you want! And so here's a full-stack demo.☆62Updated 8 years ago
- Apache Kafka HTTP Endpoint for producing and consuming messages from topics☆153Updated 10 years ago
- Scripts for running Apache Kafka on Mesosphere's Marathon☆14Updated 9 years ago
- Graph Analytics Engine☆259Updated 10 years ago
- A dataset downloaded from the deep and scientific web across three major Polar data centers for use in research.☆13Updated 7 years ago
- A spark sbt blueprint to build your own spark apps off of (for cloud native runtime, see the kube/spark examples)☆56Updated 5 years ago
- Kaltura's next generation Analytics solution based on Spark, Cassandra and Kafka☆12Updated 2 years ago
- PredictionIO word2vec engine template (Scala-based parallelized engine)☆12Updated 10 years ago
- Elastic Sentiment Analysis (using Apache Mesos, Marathon and Apache Spark)☆35Updated 10 years ago
- UberSocialNet—applying the Lambda Architecture☆30Updated 11 years ago
- A Storm based web crawler with Cassandra backend☆28Updated 11 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 9 years ago
- Data Science Research Architecture, Data Center OS☆21Updated 8 years ago
- Storm Spout + Kafka State Inspector☆58Updated 5 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Updated 10 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- Elastic Search on Spark☆112Updated 10 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Fabric-based framework for deploying and managing SolrCloud clusters in the cloud.☆90Updated 6 years ago
- PredictionIO vanilla engine template (Scala-based parallelized engine)☆26Updated 5 years ago