smorin / hadoop-single-node-cluster
Install a single node hadoop 2 cluster with 1 command
☆21Updated 6 years ago
Alternatives and similar repositories for hadoop-single-node-cluster:
Users that are interested in hadoop-single-node-cluster are comparing it to the libraries listed below
- training material☆47Updated 6 months ago
- ☆55Updated 11 years ago
- DEPRECATED! Use https://github.com/h2oai/sparkling-water repository! H2O and Spark interoperability based on Tachyon.☆44Updated 10 years ago
- ADMM based large scale logistic regression☆337Updated last year
- Facebook Presto docker image for development and testing purposes. https://hub.docker.com/r/zhicwu/presto/☆10Updated 8 years ago
- k-means + a linear model = good results☆55Updated 10 years ago
- Github contest☆40Updated 15 years ago
- Distributed Web Crawler, Parser and Search Engine.☆10Updated 8 years ago
- You wanna learn how to use Hadoop, start here!☆39Updated 12 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Scala cheat sheet☆23Updated 11 years ago
- personal cheatsheets on various technologies☆25Updated 8 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 9 years ago
- Automates Spark standalone cluster tasks with Puppet and Fabric.☆43Updated 10 years ago
- Templates for projects based on top of H2O.☆38Updated last month
- Spark MLlib code optimized to efficiently support sparse data☆51Updated 8 years ago
- Simple Spark Application☆76Updated last year
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Stand-alone recommender system from Myrrix☆108Updated last year
- Scala port of the word2vec toolkit.☆11Updated 8 years ago
- Using Spark SQLContext, HiveContext & Spark DataFrames API with ElasticSearch, Cassandra & MongoDB☆22Updated 8 years ago
- Stanford CoreNLP: A Java suite of core NLP tools.☆8Updated 8 years ago
- Weka on Spark☆32Updated 6 years ago
- A chef cookbook for deploying spark☆30Updated 12 years ago
- Apache Spark jobs such as Principal Coordinate Analysis.☆74Updated 8 years ago
- A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data☆47Updated 9 years ago
- Day 20 demo application☆50Updated 11 years ago
- Spark examples☆41Updated 11 months ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- Code used in "Pro Spark Streaming: The Zen of Real-time Analytics using Apache Spark" published by Apress Publishing.☆48Updated 9 years ago