smorin / hadoop-single-node-clusterLinks
Install a single node hadoop 2 cluster with 1 command
☆21Updated 6 years ago
Alternatives and similar repositories for hadoop-single-node-cluster
Users that are interested in hadoop-single-node-cluster are comparing it to the libraries listed below
Sorting:
- Scala cheat sheet☆23Updated 11 years ago
- training material☆47Updated 8 months ago
- DEPRECATED! Use https://github.com/h2oai/sparkling-water repository! H2O and Spark interoperability based on Tachyon.☆44Updated 10 years ago
- A Storm based web crawler with Cassandra backend☆28Updated 11 years ago
- Templates for projects based on top of H2O.☆38Updated 4 months ago
- Distributed Web Crawler, Parser and Search Engine.☆10Updated 9 years ago
- A Nutch 2.2.1 plugin which allows users to shuffle off the responsibility for retrieving pages to a selenium hub/node spoke system. This …☆16Updated 9 years ago
- Repository for SF QConf 2015 Workshop☆16Updated 8 months ago
- Dockerfiles to create Fuse containers in docker.io☆33Updated 10 years ago
- Load your MongoDB collection into Hive. Supports complex JSON structure.☆24Updated 10 years ago
- Dockerfiles and scripts for Spark and Shark Docker images☆259Updated 11 years ago
- Simple Spark Application☆76Updated last year
- Hadoop Cluster Configurations☆33Updated 3 years ago
- Learning problem-solving, logic/set, math, physics, economics through functional programming using Haskell☆19Updated 9 years ago
- Twitter-Kafka Data Pipeline☆16Updated 8 months ago
- THIS REPOSITORY IS VERY OUTDATED. See Ansible Galaxy instead.☆28Updated 6 years ago
- Spark examples☆41Updated last year
- A collection of efficient utilities for a data scientist.☆41Updated 10 years ago
- Docker Cloudera Quick Start Image☆93Updated 7 years ago
- Hadoop Map-Reduce Design Patterns☆73Updated 2 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- Sparking Using Java8☆17Updated 10 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- ☆9Updated 2 years ago
- PredictionIO Recommendation Engine Template (Scala-based parallelized engine)☆80Updated 6 years ago
- Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems☆19Updated 11 years ago
- Personal development repository to prepare contributions and patches for Apache Mahout☆16Updated 15 years ago
- Offline Elasticsearch index generator☆26Updated 4 years ago
- Mirror of Apache Hadoop common☆15Updated 4 years ago
- Laboratory Material for the course on Cloud Computing☆68Updated 3 years ago