jacobceles / intro-to-colab-pyspark-emr

A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics like EMR sizing, Google Colaboratory, fine-tuning PySpark jobs, and much more.
18Updated 3 years ago

Related projects

Alternatives and complementary repositories for intro-to-colab-pyspark-emr