roshankoirala / pySpark_tutorial

Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning
28Updated 4 years ago

Related projects

Alternatives and complementary repositories for pySpark_tutorial