garystafford / pyspark-setup-demo
Demo of PySpark and Jupyter Notebook with the Jupyter Docker Stacks
☆35Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for pyspark-setup-demo
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 4 years ago
- Interactive Notebooks that support the book☆38Updated 4 years ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- Repository used for Spark Trainings☆53Updated last year
- A Spark cluster setup running on Docker containers☆60Updated 4 years ago
- Business Data Analysis by HiPIC of CalStateLA☆20Updated 6 years ago
- spark on kubernetes☆105Updated last year
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Real-world Spark pipelines examples☆83Updated 6 years ago
- ☆20Updated 5 years ago
- AWS Big Data Certification☆25Updated last year
- One click deploy docker-compose with Kafka, Spark Streaming, Zeppelin UI and Monitoring (Grafana + Kafka Manager)☆119Updated 3 years ago
- Cloud Dataproc: Samples and Utils☆198Updated 2 weeks ago
- Code that was used as an example during the Data+AI Summit 2020☆15Updated 3 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆172Updated 11 months ago
- Spark and Delta Lake Workshop☆22Updated 2 years ago
- Repo for all my code on the articles I post on medium☆105Updated 2 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 7 years ago
- A project with examples of using few commonly used data manipulation/processing/transformation APIs in Apache Spark 2.0.0☆25Updated 3 years ago
- ☆109Updated last year
- ( These solutions tested on 4 node Hortonwork cluster on my laptop. Do not test on your production environment until you test... :)☆20Updated 4 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 3 months ago
- This repository contains code for Spark Streaming☆21Updated 3 years ago
- How to manage Slowly Changing Dimensions with Apache Hive☆55Updated 5 years ago
- Materials of the Official Helm Chart Webinar☆27Updated 3 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆83Updated 4 years ago