meabed / nutch-cassandra-dockerLinks
Nutch with Cassandra and Elasticsearch on Docker
☆17Updated 4 years ago
Alternatives and similar repositories for nutch-cassandra-docker
Users that are interested in nutch-cassandra-docker are comparing it to the libraries listed below
Sorting:
- Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.☆382Updated 3 years ago
- ☆28Updated 9 years ago
- ☆25Updated 10 years ago
- TinkerPop 3 implementation on Elasticsearch backend☆70Updated 10 years ago
- Elasticsearch entity resolution plugin based on Duke☆209Updated 5 years ago
- Graph Processing Algorithms on top of Neo4j☆39Updated 8 years ago
- Integration of Samza and Luwak☆100Updated 11 years ago
- A platform for real-time streaming search☆102Updated 9 years ago
- IoT - It's the thing you want! And so here's a full-stack demo.☆62Updated 9 years ago
- A java library for stored queries☆378Updated 2 years ago
- Docker image for general apache spark client☆116Updated 8 years ago
- https://github.com/apache/incubator-myriad is our new home. See☆253Updated 10 years ago
- ☆110Updated 8 years ago
- [PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a …☆329Updated 3 years ago
- ☆76Updated 10 years ago
- Run PredictionIO inside Docker☆200Updated 7 years ago
- Behemoth is an open source platform for large scale document analysis based on Apache Hadoop.☆283Updated 7 years ago
- RDF-Centric Map/Reduce Framework and Freebase data conversion tool☆149Updated 4 years ago
- [DEPRECATED] This project is deprecated. It will be archived on December 1, 2017.☆184Updated 8 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆75Updated 3 years ago
- Elasticsearch reindexing tool.☆46Updated 2 years ago
- Graphify is a Neo4j unmanaged extension used for document and text classification using graph-based hierarchical pattern recognition.☆378Updated 5 years ago
- Apache Nutch fork tunned for web services and data discovery.☆10Updated 10 years ago
- TinkerPop3 (Moved To Apache TinkerPop)☆214Updated 9 years ago
- Next-generation web analytics processing with Scala, Spark, and Parquet.☆331Updated 10 years ago
- Beyond Piwik Analytics with Scala and Apache Spark☆46Updated 11 years ago
- This project combines Apache Spark and Elasticsearch to enable mining & prediction for Elasticsearch.☆212Updated 11 years ago
- Google Compute Engine Cloud plugin for Elasticsearch☆58Updated 7 years ago
- Mazerunner extends a Neo4j graph database to run scheduled big data graph compute algorithms at scale with HDFS and Apache Spark.☆127Updated 10 years ago
- [DEPRECATED] This project is deprecated. It will be archived on December 1, 2017.☆147Updated 9 years ago