meabed / nutch-cassandra-dockerLinks
Nutch with Cassandra and Elasticsearch on Docker
☆17Updated 3 years ago
Alternatives and similar repositories for nutch-cassandra-docker
Users that are interested in nutch-cassandra-docker are comparing it to the libraries listed below
Sorting:
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Updated 9 years ago
- ☆27Updated 9 years ago
- Parse wikipedia dumps and index (some) page data to elasticsearch☆49Updated 9 years ago
- Integration of Samza and Luwak☆99Updated 10 years ago
- PredictionIO Classification Engine Template (Scala-based parallelized engine)☆39Updated 6 years ago
- Data Science Research Architecture, Data Center OS☆21Updated 9 years ago
- Docker containers for Druid nodes☆27Updated 9 years ago
- Fabric-based framework for deploying and managing SolrCloud clusters in the cloud.☆90Updated 6 years ago
- Graph Analytics Engine☆260Updated 10 years ago
- TinkerPop 3 implementation on Elasticsearch backend☆70Updated 9 years ago
- UNRELEASED. An opinionated framework for analytics-on-write on event streams using key-value storage☆14Updated 9 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 7 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Updated 10 years ago
- This repository contains a Docker image of the latest version of the Neo4j community server☆48Updated 5 years ago
- An Elasticsearch Plugin that notifies about changes to indices☆92Updated 9 years ago
- ☆28Updated 9 years ago
- Jetstream is a streaming processing framework☆113Updated 9 years ago
- IoT - It's the thing you want! And so here's a full-stack demo.☆62Updated 8 years ago
- Lucene based indexing in Cassandra☆61Updated 9 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 8 years ago
- Twitter Streaming API Example with Kafka Streams in Scala☆49Updated 8 years ago
- Scala client for the Lightning data visualization server (WIP)☆47Updated 6 years ago
- A DC/OS time series demo☆62Updated 9 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- A platform for real-time streaming search☆102Updated 9 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- Google Compute Engine Cloud plugin for Elasticsearch☆59Updated 6 years ago
- A Cascading Workflow Visualizer☆83Updated 2 years ago
- Elastic Sentiment Analysis (using Apache Mesos, Marathon and Apache Spark)☆35Updated 10 years ago
- TinkerPop3 (Moved To Apache TinkerPop)☆214Updated 8 years ago