roshankoirala / pySpark_tutorial
Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning
☆28Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for pySpark_tutorial
- ☆19Updated 6 years ago
- Building simple ML apps with Streamlit☆25Updated 3 years ago
- Small example on how you can detect multicollinearity☆13Updated 3 years ago
- ☆11Updated 3 years ago
- Book Projects☆24Updated 3 years ago
- This repository consist of a 50-day program. All the statistics required for the complete understanding of data science will be uploaded …☆25Updated 2 years ago
- ☆18Updated 3 years ago
- PySpark Tutorial for Beginners on Google Colab: Hands-On Guide☆16Updated 4 years ago
- Use Multiple Linear Regression, Python, Pandas, and Matplotlib to analyze the lifetime value and the key factors of the ‘Telco Customer C…☆10Updated 4 years ago
- A simple example to showcase machine learning model deployment with an API☆10Updated 2 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- Kubeflow installation on windows 10/11☆16Updated last year
- Streamlit example showing Scikit Learn & Pyspark ML over Healthcare data ! Its simple !!☆30Updated 3 years ago
- The repository for the course in Udemy☆17Updated 5 years ago
- Binary classification using scikitlearn and xgboost☆18Updated 5 years ago
- Detailed Tensorflow2 Object Detection Tutorial Step by Step Explained☆23Updated 4 years ago
- Laptop Prices Predictor is an end-to-end data science project that accurately predicts laptop prices using machine learning algorithms. T…☆14Updated 3 months ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆18Updated 3 years ago
- Predict the number of deaths due to covid19 in the next two weeks☆11Updated 2 years ago
- In this Complete process in machine learning is discussed and done with pyspark .☆18Updated 4 years ago
- Contains slides and hands-on tutorials for understanding and implementing Transformers in Natural Language Processing. Uses the HuggingFa…☆27Updated 4 years ago
- Portofolio repository for Udacity Data Scientist Nanodegree☆38Updated 4 years ago
- Customer-base segmentation over e-commerce sales data☆24Updated 4 years ago
- Recency, Frequency, and Monetary are three behavioral attributes and are quite simple, in that they can be easily computed for any databa…☆15Updated last year
- Code repo for Packt course I developed, "Beginning Data Wrangling with Python"☆28Updated 4 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 5 years ago
- ☆13Updated 3 years ago
- Contains relevant notebooks for the hands-on NLP workshop for the GIDS AIML Conference -2020 Edition☆23Updated 3 years ago
- Demonstrating the efficiency of pmdarima’s auto_arima() function compared to implementing a traditional ARIMA model.☆9Updated 3 years ago
- The repository of the book: Deep Learning with Python by Francois Chollet☆16Updated 5 years ago