bryanyang0528 / docker-spark-hive-ipython
Spark + Jupyer + Hive
☆16Updated 9 years ago
Alternatives and similar repositories for docker-spark-hive-ipython:
Users that are interested in docker-spark-hive-ipython are comparing it to the libraries listed below
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆29Updated 2 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆41Updated 7 years ago
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0☆25Updated 3 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Learning Spark SQL, published by Packt☆42Updated 2 years ago
- Example project showing how to use Hive UDFs in Apache Spark☆55Updated 6 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆84Updated 5 years ago
- ☆41Updated 8 years ago
- Spark 3.0.0 Structured Streaming Kafka Avro Demo☆15Updated 2 years ago
- Spark + HDFS cluster using docker compose☆48Updated 6 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- These are some code examples☆55Updated 5 years ago
- This tutorial provides a quick introduction to using Spark☆57Updated 9 years ago
- Spark Streaming HBase Example☆22Updated 9 years ago
- Kafka, Spark Streaming, Kudu integration examples☆17Updated 7 years ago
- Code repository for Large Scale Machine Learning with Spark by Packt☆20Updated 2 years ago
- graphx example☆24Updated 9 years ago
- Examples of all Machine Learning Algorithm in Apache Spark☆15Updated 7 years ago
- Supporting content (slides and exercises) for the Addison-Wesley (Pearson) video series covering best practices for developing scalable S…☆67Updated 9 years ago
- ☆38Updated 7 years ago
- ElasticSearch integration for Apache Spark☆47Updated 9 years ago
- A plugin to Apache Airflow to allow you to run Spark Submit Commands as an Operator☆73Updated 5 years ago
- Mastering-Scala-Machine-Learning☆36Updated 2 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- Multi docker container images for main Big Data Tools. (Hadoop, Spark, Kafka, HBase, Cassandra, Zookeeper, Zeppelin, Drill, Flink, Hive, …☆34Updated 4 months ago
- Apache Spark docker container image (Standalone mode)☆35Updated 4 years ago
- ☆24Updated 8 years ago
- ☆23Updated 8 years ago
- Using Spark SQLContext, HiveContext & Spark DataFrames API with ElasticSearch, Cassandra & MongoDB☆22Updated 8 years ago
- Examples To Help You Learn Apache Spark☆77Updated 6 years ago