meabed / nutch-cassandra-docker
Nutch with Cassandra and Elasticsearch on Docker
☆17Updated 3 years ago
Alternatives and similar repositories for nutch-cassandra-docker:
Users that are interested in nutch-cassandra-docker are comparing it to the libraries listed below
- PredictionIO Classification Engine Template (Scala-based parallelized engine)☆39Updated 5 years ago
- ☆27Updated 8 years ago
- Parse wikipedia dumps and index (some) page data to elasticsearch☆49Updated 9 years ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Updated 8 years ago
- Integration of Samza and Luwak☆99Updated 10 years ago
- Data Science Research Architecture, Data Center OS☆21Updated 8 years ago
- A DC/OS time series demo☆62Updated 9 years ago
- ☆28Updated 8 years ago
- Collects multimedia content shared through social networks.☆19Updated 9 years ago
- UberSocialNet—applying the Lambda Architecture☆29Updated 11 years ago
- Apache Nutch fork tunned for web services and data discovery.☆9Updated 9 years ago
- Docker image for apache zeppelin☆38Updated 7 years ago
- An Akka Extension for easy integration of spark and cassandra in Akka micro services.☆25Updated 10 years ago
- ☆50Updated 9 years ago
- Open source analytics platform powered by Apache Cassandra, Spark, and Kafka☆34Updated 9 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- Graph Analytics Engine☆260Updated 10 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- Sample migration from Titan 0.5.4 to Titan 1.0.0☆17Updated 8 years ago
- TinkerPop 3 implementation on Elasticsearch backend☆70Updated 9 years ago
- Additional opennlp mapping type for elasticsearch in order to perform named entity recognition☆136Updated 8 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 6 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- The first Open Source document analysis platform☆65Updated 3 years ago
- IoT - It's the thing you want! And so here's a full-stack demo.☆62Updated 8 years ago
- A distributed computing cluster-in-a-box: Mesos, zookeeper, chronos, marathon, storm + add your own. Use other physical computers to add …☆25Updated 10 years ago
- A Storm based web crawler with Cassandra backend☆28Updated 11 years ago
- Fabric-based framework for deploying and managing SolrCloud clusters in the cloud.☆90Updated 5 years ago
- Elastic Sentiment Analysis (using Apache Mesos, Marathon and Apache Spark)☆35Updated 9 years ago