martinprobson / vagrant-hadoop-hive-spark
Vagrant project to spin up a single node VM running current versions of Hadoop, Hive and Spark
☆67Updated 3 years ago
Alternatives and similar repositories for vagrant-hadoop-hive-spark:
Users that are interested in vagrant-hadoop-hive-spark are comparing it to the libraries listed below
- Real-world Spark pipelines examples☆83Updated 6 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆92Updated 7 years ago
- Apache Spark and Apache Kafka integration example☆124Updated 7 years ago
- Vagrant project to spin up a single virtual machine running current versions of Hadoop, Hive and Spark☆75Updated 6 years ago
- Ambari YARN UTILS☆30Updated last year
- Project for James' Apache Spark with Scala course☆127Updated 4 years ago
- ☆54Updated 10 years ago
- Spark Structured Streaming / Kafka / Cassandra / Elastic☆183Updated 2 years ago
- Dockerized HDP Cluster☆84Updated 7 years ago
- ☆245Updated 5 years ago
- Example Maven configuration for a Spark, Scala project☆54Updated 2 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 9 years ago
- Examples of Spark 2.0☆211Updated 3 years ago
- Few scripts to automate daily data loads from RDBMS to Partitioned Avro Hive table☆29Updated 10 years ago
- Spark Examples☆125Updated 3 years ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆118Updated 8 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆88Updated 11 months ago
- Sample processing code using Spark 2.1+ and Scala☆51Updated 4 years ago
- Simple example for reading and writing into Kafka☆55Updated 4 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆63Updated 5 years ago
- A Docker container with a full Hadoop cluster setup with Spark and Zeppelin☆65Updated 5 years ago
- Docker Cloudera Quick Start Image☆91Updated 7 years ago
- Spark, Spark Streaming and Spark SQL unit testing strategies☆218Updated 8 years ago
- Code for the deployment of Hadoop clusters, written in Bourne or Bourne Again shell.☆34Updated last year
- ansible playbook to deploy cloudera hadoop components to the cluster☆52Updated 6 years ago
- Lightweight proxy to expose the UI of an Apache Spark cluster that is behind a firewall☆98Updated 4 years ago
- An implementation of a real-world map-reduce workflow in each major framework.☆151Updated 8 years ago
- Build configuration-driven ETL pipelines on Apache Spark☆159Updated 2 years ago
- Apache Spark in your IDE with gradle☆38Updated 3 years ago