roshankoirala / pySpark_tutorial
Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple DataFrames, visualization, Machine Learning
☆29Updated 4 years ago
Alternatives and similar repositories for pySpark_tutorial:
Users that are interested in pySpark_tutorial are comparing it to the libraries listed below
- ☆18Updated 6 years ago
- PySpark Tutorial for Beginners on Google Colab: Hands-On Guide☆16Updated 4 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- Building simple ML apps with Streamlit☆24Updated 4 years ago
- Kubeflow installation on windows 10/11☆17Updated 2 years ago
- Small example on how you can detect multicollinearity☆13Updated 3 years ago
- This is a guided certification project, as a part of Data Science for Social Good initiative☆17Updated 5 years ago
- The repository of the book: Deep Learning with Python by Francois Chollet☆18Updated 5 years ago
- ☆11Updated 4 years ago
- This is code depository for my upcoming session. Will update details post the session☆40Updated 2 years ago
- Book Projects☆24Updated 4 years ago
- Tutorials for using Tensorflow and Keras for NLP☆14Updated 7 years ago
- Binary classification using scikitlearn and xgboost☆18Updated 5 years ago
- Predict the number of deaths due to covid19 in the next two weeks☆11Updated 2 years ago
- The repository for the course in Udemy☆16Updated 5 years ago
- Code repo for Packt course I developed, "Beginning Data Wrangling with Python"☆30Updated 4 years ago
- Hands-on examples showcasing popular NLP applications☆19Updated 5 years ago
- Use Multiple Linear Regression, Python, Pandas, and Matplotlib to analyze the lifetime value and the key factors of the ‘Telco Customer C…☆10Updated 4 years ago
- This repository is to host template for calculating ROI on Artificial Intelligence projects☆44Updated 5 years ago
- A simple example to showcase machine learning model deployment with an API☆10Updated 3 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Updated 3 years ago
- Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the H…☆21Updated 2 years ago
- Contains relevant notebooks for the hands-on NLP workshop for the GIDS AIML Conference -2020 Edition☆23Updated 3 years ago
- ☆31Updated 5 years ago
- Course on Udemy by Jose Portilla☆99Updated 7 years ago
- This is a repo for all the time series related notebook for AIENgineering☆2Updated 4 years ago
- Pyspark in Google Colab: A simple machine learning (Linear Regression) model☆36Updated 6 years ago
- In this Complete process in machine learning is discussed and done with pyspark .☆19Updated 4 years ago
- A packaged and flexible version of the CRAFT text detector and Keras CRNN recognition model.☆11Updated 4 years ago
- Deployment of a Machine Learning Model to Heroku Cloud☆20Updated 2 years ago