bryanyang0528 / docker-spark-hive-ipython
Spark + Jupyer + Hive
☆16Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for docker-spark-hive-ipython
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆28Updated last year
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- Kafka, Spark Streaming, Kudu integration examples☆17Updated 6 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- GeoIP Functions for hive☆49Updated 4 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Updated last year
- Learning Spark SQL, published by Packt☆40Updated last year
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0☆25Updated 3 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆75Updated 5 years ago
- Spark Streaming HBase Example☆22Updated 8 years ago
- ☆47Updated 4 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 5 years ago
- This project describes how to write full ETL data pipeline using spark.☆15Updated 2 years ago
- Spark + HDFS cluster using docker compose☆47Updated 6 years ago
- Flume-to-Spark-Streaming Log Parser☆23Updated 8 years ago
- Mastering Spark for Data Science, published by Packt☆46Updated last year
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 8 years ago
- Deploy your Spark Production Cluster on Kubernetes☆47Updated 4 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆83Updated 4 years ago
- ElasticSearch integration for Apache Spark☆47Updated 8 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- graphx example☆24Updated 8 years ago
- Demo showcasing Spark Streaming, Kafka, Kudu - all in Python☆27Updated 7 years ago
- ☆105Updated 4 years ago
- A collection of Hive UDFs☆75Updated 4 years ago
- Code files uploaded by Packt publishing☆31Updated 3 years ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆33Updated last year
- Scala and Spark for Big Data Analytics, published by Packt☆34Updated last year
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- ☆38Updated 6 years ago