alexholmes / vagrant-hadoop-spark-hive
Vagrant project to spin up a single virtual machine running current versions of Hadoop, Hive and Spark
☆75Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for vagrant-hadoop-spark-hive
- Example of use of Spark Streaming with Kafka☆90Updated 10 years ago
- Elastic Search on Spark☆112Updated 10 years ago
- Code for Tutorial on designing clickstream analytics application using Hadoop☆54Updated 9 years ago
- Apache Spark applications☆70Updated 6 years ago
- Simple Spark Application☆76Updated 11 months ago
- Kite SDK Examples☆99Updated 3 years ago
- SequenceIQ Hadoop examples☆115Updated 9 years ago
- ☆71Updated 7 years ago
- Vagrant project to spin up a cluster of 4 32-bit CentOS6.5 Linux virtual machines with Hadoop v2.6.0 and Spark v1.1.1☆126Updated 8 years ago
- NRT Sessionization with Spark Streaming landing on HDFS and putting live stats in HBase☆51Updated 10 years ago
- Apache Spark and Apache Kafka integration example☆124Updated 6 years ago
- ☆48Updated 6 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆92Updated 7 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 5 years ago
- Code for Packt Publishing's Scala Data Analysis Cookbook.☆49Updated 8 years ago
- An example of using Avro and Parquet in Spark SQL☆60Updated 9 years ago
- Source code of Blog at☆52Updated 5 months ago
- A Spark WordCountJob example as a standalone SBT project with Specs2 tests, runnable on Amazon EMR☆118Updated 8 years ago
- Counting Twitter hashtags using Spark Streaming and Cassandra☆41Updated 9 years ago
- ☆54Updated 10 years ago
- Based off the design of SparkOnHBase. This Repo will support Spark, Spark Streaming, and Spark SQL integration with Kudu.☆51Updated 8 years ago
- A super simple utility for testing Apache Hive scripts locally for non-Java developers.☆72Updated 7 years ago
- Example programs and scripts for accessing parquet files☆30Updated 6 years ago
- ☆92Updated 7 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Helpful user defined fuctions / table generating functions for Hive☆101Updated 8 years ago
- Spark structured streaming with Kafka data source and writing to Cassandra☆64Updated 4 years ago